本文「computer use agents github」を検索

1 - 40 件 / 108件

新着順人気順

絞り込み

検索対象
ブックマーク数
期間
セーフサーチ

computer use agents githubの検索結果1 - 40 件 / 108件

Devinを導入して1ヶ月経ったので、人間とAIとでどのような開発の役割分担をするべきか振り返ってみる - Generative Agents Tech Blog
- 208 users
- blog.generative-agents.co.jp
- テクノロジー
- 2025/02/23
こんにちは、ジェネラティブエージェンツの西見です。「完全自律型AIエンジニア」という触れ込みと、その印象的なティザー動画で一躍有名になったDevinが、2024年12月10日にGAしました。 www.cognition.ai それからしばらく経ったこともあって、X上でもチラホラと日本企業におけるDevin採用報告が聞こえてくるようになり、「こんなタスクには使えた😆」「簡単なタスクにハマり続けて使えない、金もったいない😭」といったポストがよく見られるようになりました。正直なところ、月500ドルは高いなぁ・・・*1なんて思っていたのですが、弊社も多分に漏れず猫の手も借りたい状況なのもあって、2025年1月22日からDevin（猫の手）を採用してみました。それからちょうど1ヶ月が経ったので、弊社の開発状況にどんな変化があったのかを振り返って、レポートしてみたいと思います。 GitHubア
- AI
- あとで読む
- Devin
- 開発
- 人工知能
- techfeed
- 機械学習
Claude 3.7 Sonnet and Claude Code
- 161 users
- www.anthropic.com
- テクノロジー
- 2025/02/25
Today, we’re announcing Claude 3.7 Sonnet1, our most intelligent model to date and the first hybrid reasoning model on the market. Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. API users also have fine-grained control over how long the model can think for. Claude 3.7 Sonnet shows particularly strong improvements in coding
- AI
- あとで読む
- LLM
- claude
Introducing Claude Opus 4.5
- 143 users
- www.anthropic.com
- テクノロジー
- 2025/11/25
Our newest model, Claude Opus 4.5, is available today. It’s intelligent, efficient, and the best model in the world for coding, agents, and computer use. It’s also meaningfully better at everyday tasks like deep research and working with slides and spreadsheets. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done. Claude Opus 4.5 is state-of-
- Claude
- あとで読む
- AI
- Anthropic
- LLM
- 未分類
- 人工知能
Microsoft Build 2025の新発表まとめ【30選】
- 107 users
- zenn.dev/galirage
- テクノロジー
- 2025/05/21
はじめまして、ますみです！株式会社Galirage（ガリレージ）という「生成AIに特化して、システム開発・アドバイザリー支援・研修支援をしているIT企業」で、代表をしております^^ この記事では、Microsoft Build 2025の発表内容をまとめていきたいと思います🎉 もしも現地で参加している方は、ぜひ会場で見かけたらお声がけいただけたら嬉しいです^^ ちなみに、現地のKeynoteの会場の雰囲気はこんな感じでした！！！イントロダクションまず、CEOのサティア・ナデラさんは、Building the open agentic web という世界観を発表しました！このフレーズは、Build 2025の重要なテーマであり、この後の最新発表につながっています！さらに、以下のDeveloper tools と次の4段階のレイヤーに分類をして、これ以降の発表をしていきます。 A
- Microsoft
- あとで読む
- まとめ
- 人工知能
- techfeed
- AI
Code Interpreter API
- 90 users
- blog.langchain.com
- テクノロジー
- 2023/07/18
Editor's Note: This is another installation of our guest blog posts highlighting interesting and novel use cases. This blog is written by Shroominic who built an open source implementation of the ChatGPT Code Interpreter. Important Links: GitHub RepoIn the world of open-source software, there are always exciting developments. Today, I am thrilled to announce a new project that I have been working
- python
- あとで読む
- AI
- OpenAI
- コード
- プログラミング
- API
2024年生成AIエージェントのおすすめ論文 16選 - 襖からキリン
- 61 users
- masamasa59.hatenablog.com
- テクノロジー
- 2024/12/25
こんにちは！ AIエージェントに一年を捧げた太田（https://x.com/ottamm_190）です。年末のエージェント記事の第四弾です。第一弾→ Weekly AI Agent News!から見えたAIエージェントの現在地 - 襖からキリン第二弾→ AIエージェントビジネスの現状と今後の考察 - 襖からキリン第三弾→ 生成AIエージェントが刺さる業務課題を探そう！ - 襖からキリン今年のWeekly AI Agents News!を更新し続けて個人的に学びがあった論文を紹介します。特に研究者よりかはビジネス層やエンジニア層に読んで学びがありそうなのを満遍なく16本紹介します。キリ良く15本には削れなかったですね。はい。読者層は真ん中ぜひ、年末にお手元の生成AIを使って読んでみてください。質問例も載せておきます。（生成結果は確認していませんが、当時聞いたような記憶も
GitHub - modelcontextprotocol/servers: Model Context Protocol Servers
- 60 users
- github.com/modelcontextprotocol
- テクノロジー
- 2024/11/28
Official integrations are maintained by companies building production ready MCP servers for their platforms. 21st.dev Magic - Create crafted UI components inspired by the best 21st.dev design engineers. 2slides - An MCP server that provides tools to convert content into slides/PPT/presentation or generate slides/PPT/presentation with user intention. ActionKit by Paragon - Connect to 130+ SaaS inte
- MCP
- AI
- LLM
- Anthropic
- server
- protocol
- github
- プログラミング
GitHub - bregman-arie/devops-exercises: Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
- 58 users
- github.com/bregman-arie
- テクノロジー
- 2021/08/27
In general, what do you need in order to communicate? A common language (for the two ends to understand) A way to address who you want to communicate with A Connection (so the content of the communication can reach the recipients) What is TCP/IP? A set of protocols that define how two or more devices can communicate with each other. To learn more about TCP/IP, read here What is Ethernet? Ethernet
AI破産を防ぐために - LLM API利用におけるEconomic DoSのリスクと対策 - GMO Flatt Security Blog
- 55 users
- blog.flatt.tech
- テクノロジー
- 2025/05/21
はじめにこんにちは、GMO Flatt Security株式会社セキュリティエンジニアの松井（@ryotaromosao）です。近年、LLM（大規模言語モデル）が目覚ましい進化を遂げており、それを利用したLLMアプリケーションが急速に増加しています。特に、AIチャット機能やエージェント機能が既存のサービスに搭載されるのを目にする機会も多いと思います。しかしながら、LLM APIを用いたアプリケーションを提供する事業者にとって、「高額なAPIの利用料金を請求されたらどうしよう」という不安は大きいのではないでしょうか。私も自社開発のセキュリティ診断AIエージェントのTakumiを使って脆弱性診断やリサーチ活動をしていますが、そのLLM APIの利用料金にはいつもビクビクしています。まだ最適化が為されていなかった、Takumiの開発中の話ではありますが、脆弱性のリサーチ中に「このリポジ
- AI
- security
- LLM
- APIエコノミー
- 人工知能
- セキュリティ
- あとで読む
- techfeed
- API
The End of Programming – Communications of the ACM
- 47 users
- cacm.acm.org
- テクノロジー
- 2022/12/22
The end of classical computer science is coming, and most of us are dinosaurs waiting for the meteor to hit. I came of age in the 1980s, programming personal computers such as the Commodore VIC-20 and Apple ][e at home. Going on to study computer science (CS) in college and ultimately getting a Ph.D. at Berkeley, the bulk of my professional training was rooted in what I will call “classical” CS: p
- AI
- programming
- 機械学習
- Technology
- Society
- science
- computer
- プログラム
Build and deploy Remote Model Context Protocol (MCP) servers to Cloudflare
- 38 users
- blog.cloudflare.com
- テクノロジー
- 2025/03/26
Build and deploy Remote Model Context Protocol (MCP) servers to Cloudflare2025-03-25 It feels like almost everyone building AI applications and agents is talking about the Model Context Protocol (MCP), as well as building MCP servers that you install and run locally on your own computer. You can now build and deploy remote MCP servers to Cloudflare. We’ve added four things to Cloudflare that handl
- mcp
- cloudflare
- LLM
- AI
Claude Skills are awesome, maybe a bigger deal than MCP
- 32 users
- simonwillison.net
- テクノロジー
- 2025/10/17
Claude Skills are awesome, maybe a bigger deal than MCP 16th October 2025 Anthropic this morning introduced Claude Skills, a new pattern for making new abilities available to their models: Claude can now use Skills to improve how it performs specific tasks. Skills are folders that include instructions, scripts, and resources that Claude can load when needed. Claude will only access a skill when it
What We Learned from a Year of Building with LLMs (Part I)
- 32 users
- www.oreilly.com
- テクノロジー
- 2024/05/30
It’s an exciting time to build with large language models (LLMs). Over the past year, LLMs have become “good enough” for real-world applications. The pace of improvements in LLMs, coupled with a parade of demos on social media, will fuel an estimated $200B investment in AI by 2025. LLMs are also broadly accessible, allowing everyone, not just ML engineers and scientists, to build intelligence into
2025: The year in LLMs
- 30 users
- simonwillison.net
- テクノロジー
- 2026/01/01
31st December 2025 This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about AI in 2023 and Things we learned about LLMs in 2024. It’s been a year filled with a lot of different trends. The year of “reasoning” The year of agents The year of coding agents and Claude Code The year of LLMs on th
AIエージェント時代のWeb〜いま、第二のレスポンシブ設計が始まっている - Nothing ventured, nothing gained.
- 30 users
- takoratta.hatenablog.com
- テクノロジー
- 2026/04/28
ブラウザを開いて、AIエージェントに「最も静音なノイズキャンセリングイヤホンを探して、明日届くように手配しておいて」と頼む。エージェントは複数のECサイトを回り、レビューを比較し、カートに入れて配送指定をした上で、決済画面で「ここから先は確認をお願いします」と返してくる。このとき、ブラウザの向こう側で何が起きているのか。エージェントはピクセルを目で見ているのか、HTMLを解釈しているのか、それともサイト側が用意した「エージェント向けの入口」を使っているのか。 AIエージェント時代のWebがどう変わっていくのかは、私自身ずっと気になっていたテーマだった。最近腰を据えて調べてみたところ、思っていた以上に議論と実装が進んでいた。今回の記事では、私が学んだ範囲で、いまWebのアーキテクチャへの変更を促しつつある二つの標準技術──WebMCPとNLWeb──を、コードスニペットを含めて紹介していく
- AI
- あとで読む
- *
Things we learned about LLMs in 2024
- 28 users
- simonwillison.net
- テクノロジー
- 2025/01/01
31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a review of things we figured out about the field in the past twelve months, plus my attempt at identifying key themes and pivotal moments. This is a sequel to my review of 2023. In this article: The GPT-4 barrier was comprehensively broken Some of those GPT-4 models run on my laptop LLM pri
- LLM
- あとで読む
Open challenges in LLM research
- 24 users
- huyenchip.com
- テクノロジー
- 2023/08/17
[LinkedIn discussion, Twitter thread] Never before in my life had I seen so many smart people working on the same goal: making LLMs better. After talking to many people working in both industry and academia, I noticed the 10 major research directions that emerged. The first two directions, hallucinations and context learning, are probably the most talked about today. I’m the most excited about num
- research
- あとで読む
Wasm-agents: AI agents running in your browser
- 23 users
- blog.mozilla.ai
- テクノロジー
- 2025/07/04
One of the main barriers to a wider adoption and experimentation with open-source agents is the dependency on extra tools and frameworks that need to be installed before the agents can be run. In this post, we introduce the Wasm agents blueprint, aimed at showing how to write agents as HTML files, which can just be opened and run in a browser, without the need for any extra dependencies. This is s
- LLM
- AI
- browser
- 人工知能
- あとで読む
Why I stopped using AI code editors · Luciano Nooijen
- 21 users
- lucianonooijen.com
- テクノロジー
- 2025/04/02
TL;DR: I chose to make using AI a manual action, because I felt the slow loss of competence over time when I relied on it, and I recommend everyone to be cautious with making AI a key part of their workflow. In late 2022, I used AI tools for the first time, even before the first version of ChatGPT. In 2023, I started using AI-based tools in my development workflow. Initially, I was super impressed
Agents
- 21 users
- huyenchip.com
- テクノロジー
- 2025/01/09
Intelligent agents are considered by many to be the ultimate goal of AI. The classic book by Stuart Russell and Peter Norvig, Artificial Intelligence: A Modern Approach (Prentice Hall, 1995), defines the field of AI research as “the study and design of rational agents.” The unprecedented capabilities of foundation models have opened the door to agentic applications that were previously unimaginabl
- agent
- llm
- tool
AIエージェントビジネスの現状と今後の考察 - 襖からキリン
- 21 users
- masamasa59.hatenablog.com
- テクノロジー
- 2024/12/06
こんにちは！年末記事の第二弾、AIエージェントに関するビジネス記事になります。現状のエージェントはどうなっているのか、今後エージェントを始める方が参考になるように説明します。第一弾の記事は既に公開されています。 Weekly AI Agent News!から見えたAIエージェントの現在地 - 襖からキリン私が公開しているWeekly AI Agent News!や論文のリポジトリはこちらです。 speakerdeck.com github.com AIエージェントに取り組む人材とは？企業のAIエージェントの状況現状の主力エージェント製品を解説エージェントビルダーリサーチ、問い合わせ対応データに基づく意思決定支援様々なソースから資料作成 Agentic Process Automation これからのエージェントを考える生成AIエージェントと業務ソフトウェアの結びつきが強
From Coder to Orchestrator: The future of software engineering with AI - Human Who Codes
- 20 users
- humanwhocodes.com
- テクノロジー
- 2026/01/21
The software engineering industry is undergoing a major AI-driven transition in how we work. The days when humans needed to write every line of code are already behind us as LLMs become more capable and reliable. The improvement in code output during 2025 alone has been astounding. I’ve personally watched LLMs struggle with certain problems, then a few months later, solve them completely and effic
GitHub - gptme/gptme: Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
- 19 users
- github.com/gptme
- テクノロジー
- 2024/10/05
Coming soon - gptme.ai service for running agents in the cloud; gptme desktop app for easy local use. 2026-01 - gptme-agent-template v0.4: Bob reaches 1700+ autonomous sessions, autonomous run loops, enhanced context generation 2025-12 - v0.31.0: Background jobs, form tool, cost tracking, content-addressable storage 2025-11 - v0.30.0: Plugin system, context compression, subagent planner mode 2025-
- AI
- GitHub
- ツール
- あとで読む
GitHub - trycua/cua: Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
- 18 users
- github.com/trycua
- テクノロジー
- 2025/02/02
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
- mac
- VM
- Virtualization
- サーバ
- github
- AI
- linux
Opus 4.5 is going to change everything
- 17 users
- burkeholland.github.io
- テクノロジー
- 2026/01/07
Edit: A lot of folks have been asking what worfklows I used to write these apps. I used GitHub Copilot in VS Code with a custom agent prompt that you’ll find toward the end of this post. Context7 was the only MCP I used. I mostly just used the built-in voice dictation feature and talked to Claude. No fancy workflows, planning, etc required. The agent harness in VS Code for Opus 4.5 is so good - yo
- AI
- あとで読む
Ten Years, Starting Again: My Journey with TiDB
- 17 users
- medium.com/@siddontang
- テクノロジー
- 2025/10/19
The most precious things in life are memories and reflection. After we released the next generation TiDB Cloud, I think it is time for some reflection. Time flies — ten years have passed. On April 1, 2015, Max asked me, very seriously on April Fools’ Day, “Do you want to start a company together?” From that moment, I jumped on the TiDB train. The ride has been bumpy and brilliant. In these ten yea
- あとで読む
claude-cycles.dvi
- 16 users
- cs.stanford.edu/~knuth
- テクノロジー
- 2026/03/04
Claude’s Cycles Don Knuth, Stanford Computer Science Department (28 February 2026; revised 06 March 2026) Shock! Shock! I learned yesterday that an open problem I’d been working on for several weeks had just been solved by Claude Opus 4.6—Anthropic’s hybrid reasoning model that had been released three weeks earlier! It seems that I’ll have to revise my opinions about “generative AI” one of these d
The Death of the Stubborn Developer
- 16 users
- steve-yegge.medium.com
- テクノロジー
- 2025/02/10
The Death of the Stubborn Developer I wrote a blog post back in May called The Death of the Junior Developer. It made people mad. My thesis has since been corroborated by a bunch of big companies, and it is also happening in other industries, not just software. It is a real, actual problem, despite being quite inconvenient for almost everyone involved. My beehive-kicking post’s main premise is pre
Patterns for Building LLM-based Systems & Products
- 16 users
- eugeneyan.com
- テクノロジー
- 2023/08/02
Patterns for Building LLM-based Systems & Products [ llm engineering production 🔥 ] · 66 min read Discussions on HackerNews, Twitter, and LinkedIn “There is a large class of problems that are easy to imagine and build demos for, but extremely hard to make products out of. For example, self-driving: It’s easy to demo a car self-driving around a block, but making it into a product takes a decade.”
- LLM
- LLMOps
- text
- あとで読む
Claude Code is the Inflection Point
- 15 users
- newsletter.semianalysis.com
- テクノロジー
- 2026/02/06
4% of GitHub public commits are being authored by Claude Code right now. At the current trajectory, we believe that Claude Code will be 20%+ of all daily commits by the end of 2026. While you blinked, AI consumed all of software development. Our sister publication Fabricated Knowledge described software like linear TV during the rise of the internet and thinks that the rise of Claude Code is going
- 人工知能
- プログラミング
The economic potential of generative AI: The next productivity frontier
- 15 users
- www.mckinsey.com
- テクノロジー
- 2023/06/19
The economic potential of generative AI: The next productivity frontier Generative AI is poised to unleash the next wave of productivity. We take a first look at where business value could accrue and the potential impacts on the workforce. AI has permeated our lives incrementally, through everything from the tech powering our smartphones to autonomous-driving features on cars to the tools retailer
- report
- AI
- あとで読む
- technology
Microservices Are a Tax Your Startup Probably Can’t Afford
- 14 users
- nexo.sh
- テクノロジー
- 2025/05/09
Let’s unpack why microservices often backfire early on, where they genuinely help, and how to structure your startup’s systems for speed and survival. Monoliths Are Not the EnemyIf you’re building some SaaS product, even a simple SQL database wrapper eventually may bring a lot of internal complexity in the way your business logic works; additionally, you can get to various integrations and backgro
Letter to Arc members 2025
- 14 users
- browsercompany.substack.com
- テクノロジー
- 2025/05/27
Untitled (to a man, George McGovern) 2, Dan Flavin. Dia Beacon, 2024.Dear Arc members,You’re probably wondering what happened. One day we were all-in on Arc. Then, seemingly out of nowhere, we started building something new: Dia. From the outside, this pivot might look abrupt. Arc had real momentum. People loved it. But inside, the decision was slower and more deliberate than it may seem. So I wan
- Arc
- browser
- article
- AI
- あとで読む
Building agents with the Claude Agent SDK
- 13 users
- www.anthropic.com
- テクノロジー
- 2025/09/30
Published Sep 29, 2025 The Claude Agent SDK is a collection of tools that helps developers build powerful agents on top of Claude Code. In this article, we walk through how to get started and share our best practices. Last year, we shared lessons in building effective agents alongside our customers. Since then, we've released Claude Code, an agentic coding solution that we originally built to supp
Real-world gen AI use cases from the world's leading organizations | Google Cloud Blog
- 11 users
- cloud.google.com
- テクノロジー
- 2025/01/04
AI is here, AI is everywhere: Top companies, governments, researchers, and startups are already enhancing their work with Google's AI solutions. Published April 12, 2024; last updated October 9, 2025. Automotive & Logistics Business & Professional Services Financial Services Healthcare & Life Sciences Hospitality & Travel Manufacturing, Industrial & Electronics Media, Marketing & Gaming Public Sec
- ai
- dev
- google
- あとで読む
Agents have their own computers with Sandboxes GA
- 11 users
- blog.cloudflare.com
- テクノロジー
- 2026/04/14
When we launched Cloudflare Sandboxes last June, the premise was simple: AI agents need to develop and run code, and they need to do it somewhere safe. If an agent is acting like a developer, this means cloning repositories, building code in many languages, running development servers, etc. To do these things effectively, they will often need a full computer (and if they don’t, they can reach for
- セキュリティ
The Next Two Years of Software Engineering
- 10 users
- addyosmani.com
- テクノロジー
- 2026/01/08
January 5, 2026 The software industry sits at a strange inflection point. AI coding has evolved from autocomplete on steroids to agents that can autonomously execute development tasks. The economic boom that fueled tech’s hiring spree has given way to an efficiency mandate: companies now often favor profitability over growth, experienced hires over fresh graduates, and smaller teams armed with bet
- AI
- あとで読む
GitHub - bytedance/UI-TARS-desktop: The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
- 9 users
- github.com/bytedance
- テクノロジー
- 2025/01/23
[2025-11-05] 🎉 We're excited to announce the release of Agent TARS CLI v0.3.0! This version brings streaming support for multiple tools (shell commands, multi-file structured display), runtime settings with timing statistics for tool calls and deep thinking, Event Stream Viewer for data flow tracking and debugging. Additionally, it features exclusive support for AIO agent Sandbox as isolated all-
- agent
- language
- 言語
How Claude Code is built
- 9 users
- newsletter.pragmaticengineer.com
- テクノロジー
- 2025/09/24
Claude Code has taken the developer world by storm since being made generally available in May. The tool is currently generating more than $500M in annual run-rate revenue, and usage has exploded by more than 10x in the three months since that May release. I recently sat down with two of the founding engineers behind Claude Code: Boris Cherny (the engineer who came up with the original prototype,
Using Amazon Bedrock Agents to interactively generate infrastructure as code | Amazon Web Services
- 9 users
- aws.amazon.com
- テクノロジー
- 2024/07/12
AWS Machine Learning Blog Using Amazon Bedrock Agents to interactively generate infrastructure as code In the diverse toolkit available for deploying cloud infrastructure, Amazon Bedrock Agents offers a practical and innovative option for teams looking to enhance their infrastructure as code (IaC) processes. Amazon Bedrock Agents automates the prompt engineering and orchestration of user-requested
- AWS
- あとで読む