misshikiのブックマーク - はてなブックマーク

misshiki id:misshiki

misshikiのブックマーク (38,752)

AIが「その感覚、完全に正しいです」などのごますり構文を使ってくる条件がAnthropicの調査により判明 - GIGAZINE
チャットAIと会話していると「その感覚、完全に正しいです」とか「めちゃくちゃ鋭い意見です」といったように不必要なごますりフレーズを返してくることがあります。AI企業のAnthropicが自社製チャットAI「Claude」の応答内容を収集し、AIがごますりフレーズを使う条件を分析した結果を公開しました。 How people ask Claude for personal guidance \ Anthropic https://www.anthropic.com/research/claude-personal-guidance ユーザーはチャットAIを資産運用や人生設計といった私生活に関する相談相手としても使用しています。このため、AIのごますり行動が多いと「計画なく仕事を辞めようとしているユーザーに『正しい判断です』と言ってしまう」といった取り返しの付かない事態に発展する可能性がありま
misshiki 2026/05/01
AnthropicがClaudeの会話100万件を分析。個人的相談は約6％、そのうちごますり行動は8.9％。スピリチュアルでは37.9％、人間関係では24.8％と高率。

Anthropic

人工知能
リンク
Claude Security is now in public beta | Claude
Claude Security is now available in public beta to Claude Enterprise customers. AI cybersecurity capabilities are advancing fast. Today’s models are already highly effective at finding flaws in software code; the next generation will be more capable still, and will be particularly effective at autonomously exploiting these flaws. Now is the time for organizations to act to improve their security,
misshiki 2026/05/01
Claude SecurityがEnterprise向けにパブリックベータ開始。Opus 4.7でコードをスキャンし、確信度・深刻度・再現方法付きで脆弱性を提示、パッチ指示も生成。定期スキャンやCSV/Markdown出力にも対応。

セキュリティ

Anthropic

人工知能
リンク
Qwen Studio
misshiki 2026/05/01
Qwenが高性能な線形アテンションカーネルライブラリ「FlashQLA」を正式にOSS公開。GDN Chunked Prefillを最適化し、Hopper上でFLA比forward 2〜3倍、backward 2倍高速化。特に事前学習やエッジ環境でのエージェント型推論で効果が大きい。

自然言語処理
リンク
What’s New in Microsoft 365 Copilot | April 2026 | Microsoft Community Hub
misshiki 2026/05/01
Microsoft 365 Copilot 2026年4月更新。ExcelにPlan modeとPython、WordにClaude、PowerPointに画像モデル選択とWeb grounding、Notebooksの文書・PPT生成、Teams通話委任、管理者向け利用分析も追加。

Microsoft

人工知能
リンク
https://openai.com/ja-JP/index/speeding-up-agentic-workflows-with-websockets/
- 1 user
- openai.com
- 学び
misshiki 2026/05/01
OpenAIがResponses APIにWebSocketモードを導入。永続接続と状態キャッシュでエージェントループを最大40%高速化。CodexではGPT-5.3-Codex-Sparkが1,000TPSを達成。

OpenAI

人工知能

プログラミング
リンク
OpenRouter on X: "alphaXiv x OpenRouter is live! When an AI paper mentions a model, alphaXiv now turns it into a preview: provider, description, use-case rankings, and a direct link to the OpenRouter model page. Go from research to model in one click with
- 1 user
- x.com
- 学び
misshiki 2026/05/01
OpenRouterがalphaXiv連携を発表。AI論文で言及されたモデルをalphaXivがprovider、説明、用途ランキング、OpenRouterモデルページ直リンク付きでプレビュー表示。

人工知能

プログラミング
リンク
アンソロピック「Mythos」、日本含め提供先拡大計画　米政権は反対 - 日本経済新聞
【シリコンバレー=山田遼太郎、ワシントン=八十島綾平】米新興アンソロピックが約50の米企業・組織に限定公開している人工知能（AI）「Claude Mythos（クロード・ミュトス）」について、日本を含め提供先の拡大を計画していることがわかった。米政権側は反対している。アンソロピックは高性能なAIのミュトスを開発したが、システムの弱点を特定する性能が高く、サイバー攻撃に悪用された場合のリスクが大
misshiki 2026/05/01
アンソロピックが約50の米企業・組織に限定公開するClaude Mythosについて、日本を含む提供先拡大を計画。米政権側は反対。弱点特定性能が高く、悪用リスクが懸念されている。

Anthropic

人工知能

セキュリティ
リンク
Continually improving our agent harness · Cursor
We approach building the Cursor agent harness the way we'd approach any ambitious software product. Much of the work is vision-driven, where we start with an opinion about what the ideal agent experience should look like. From there, we form hypotheses about how to get closer to that vision, run experiments to test them, and iterate using quantitative and qualitative signals from evals and real us
misshiki 2026/05/01
Cursorはエージェントハーネス改善の仕組みを解説。CursorBenchやA/Bテスト、Keep Rateで評価し、ツール信頼性を99%以上に改善。モデル別ツール形式、途中切替、サブエージェント対応も説明。

Cursor
リンク
https://x.com/ManusAI/status/2049870078896963962
- 1 user
- x.com
- 世の中
misshiki 2026/05/01
ManusがCloud Computerを発表。クラウド上の常時稼働マシンで、ラップトップがオフでも24/7実行。ボット、DB、ナレッジベース、OSSツール、定期スクレイパーなどをノーコードで動かせる。

人工知能
リンク
Google AI Developers on X: "Now that Gemini Embedding 2 is GA, let’s explore what the model unlocks — from agentic multimodal RAG to visual search — as it maps text, images, video, audio, and documents into a unified embedding space. https://t.co/HNUsECL2
- 1 user
- x.com
- 世の中
misshiki 2026/05/01
Gemini Embedding 2がGA。テキスト・画像・動画・音声・PDFを単一空間に埋め込み、agentic RAG、visual search、reranking、分類などに対応。MRLで3072次元を1536/768次元に圧縮可能。

Gemini

自然言語処理
リンク
https://x.com/thsottiaux/status/2049710850882380004
- 1 user
- x.com
- 世の中
misshiki 2026/05/01
Codexへの機能リクエストを「images 2.0で生成した画像」の形で送ってほしいと投稿。採用する場合、Codexが実装しやすくなるとし、すでに良い案がいくつかありCodexが取り組んでいると述べた。

Codex
リンク
Claude on X: "Claude Security is now in public beta for Claude Enterprise customers. Claude scans your codebase for vulnerabilities, validates each finding to cut false positives, and suggests patches you can review and approve. https://t.co/neYmbGYeRz"
- 1 user
- x.com
- 世の中
misshiki 2026/05/01
Claude SecurityがEnterprise顧客向けにパブリックベータ開始。コードベースをスキャンし、脆弱性検出、誤検知削減、レビュー可能なパッチ提案を行う。API統合やエージェント構築は不要。

人工知能

セキュリティ

Anthropic
リンク
Qwen Studio
misshiki 2026/05/01
“Qwen3およびQwen3.5シリーズモデルで学習させた解釈可能性ツールキット「Qwen-Scope」を発表できることを嬉しく思います。具体的には、Qwenの隠れ層内にスパースオートエンコーダ（SAE）を組み込んで学習を行いました。”

人工知能

自然言語処理
リンク
Two Heads Are Better Than One: Async Knowledge Injection for Speech AI with Tandem Architecture
This page requires Javascript. Please enable it to view the website. Two Heads Are Better Than One: Async Knowledge Injection for Speech AI with Tandem Architecture We're excited to introduce KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI. KAME means turtle in Japanese. Paper (arxiv): https://arxiv.org/abs/2510.02327 (Accepted at ICASSP 2026) Infe
misshiki 2026/05/01
Sakana AIがKAMEを紹介。S2Sモデルが即応し、バックエンドLLMが非同期にoracle信号を注入。「think then speak」から「speak while thinking」へ。

自然言語処理

音声処理
リンク
https://x.com/SakanaAILabs/status/2049711545182290301
- 1 user
- x.com
- 世の中
misshiki 2026/05/01
Sakana AIが音声AI「KAME」を発表。S2Sモデルが即応し、バックエンドLLMが非同期に推論してoracle信号を注入。「話しながら考える」音声AIを実現。ICASSP 2026採択。

自然言語処理

音声処理
リンク
https://x.com/Alibaba_Qwen/status/2049861145574690992
- 1 user
- x.com
- 世の中
misshiki 2026/05/01
QwenがQwen-Scopeを公開。Qwen3/3.5向けSAE群で、推論制御、データ分類・合成、学習時の問題追跡、評価ベンチ選定に使えると説明。

人工知能

自然言語処理
リンク
Windsurf on X: "We've partnered with @OpenAI to offer GPT-5.5 in Windsurf at 50% off through May 14 starting today."
- 1 user
- x.com
- 学び
misshiki 2026/05/01
WindsurfはOpenAIと提携し、Windsurf上のGPT-5.5を5月14日まで50%オフで提供すると発表。4月25日にはWindsurf 2.0でGPT-5.5が利用可能になったことも告知していた。

人工知能

プログラミング
リンク
https://x.com/zento_ai/status/2049720601963790606
- 1 user
- x.com
- 世の中
misshiki 2026/05/01
Claude Opus 4.7の現挙動について、長文の要件確認や詳細チェックが「しつこいが信頼できる」と評価。GPT-5.5実装部分のバグ発見にも触れた投稿。

Claude 4
リンク
https://openai.com/index/advanced-account-security/
misshiki 2026/05/01
OpenAIがAdvanced Account Securityを発表。パスキー/物理キー必須、パスワード・SMS/メール復旧無効化、学習利用自動除外。2026年6月1日から一部Cyber対象者に必須。

OpenAI

人工知能

セキュリティ
リンク
Evaluating Claude’s bioinformatics research capabilities with BioMysteryBench
misshiki 2026/05/01
AnthropicがBioMysteryBenchでClaudeのバイオインフォマティクス能力を評価。99問中76問は人間が解ける課題、23問は専門家パネルが解けない課題。Claude Mythos Previewは後者で最大30%を解決。

Anthropic

人工知能
リンク
1 2 3 4 5 6 7 8 9 10 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx