タイトル「speech」を検索 - はてなブックマーク

121 - 160 件 / 178件

新着順人気順

絞り込み

検索対象
ブックマーク数
期間
セーフサーチ

speechの検索結果121 - 160 件 / 178件

GitHub - rksm/org-ai: Emacs as your personal AI assistant. Use LLMs such as ChatGPT or LLaMA for text generation or DALL-E and Stable Diffusion for image generation. Also supports speech input / output.
- 3 users
- github.com/rksm
- テクノロジー
- 2023/03/12
Minor mode for Emacs org-mode that provides access to generative AI models. Currently supported are OpenAI API (ChatGPT, DALL-E, other text models), optionally run against Azure API instead of OpenAI Stable Diffusion through stable-diffusion-webui Inside an org-mode buffer you can use ChatGPT to generate text, having full control over system and user prompts (demo) Speech input and output! Talk wi
- org-mode
- ChatGPT
- emacs
- AI
AI Voice Generator: Versatile Text to Speech Software | Murf AI
- 3 users
- murf.ai
- テクノロジー
- 2023/02/07
Murf simplifies your business communication. Whether it’s voiceovers or translations, we provide solutions for every kind of project, making sure your message is clear, engaging, and far-reaching.
- AI
- webサービス
GitHub - ccoreilly/vosk-browser: A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
- 3 users
- github.com/ccoreilly
- テクノロジー
- 2022/05/19
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
Wav2vec: Semi-supervised and Unsupervised Speech Recognition
- 3 users
- vaclavkosar.com
- テクノロジー
- 2021/07/04
Word2vec for audio quantizes phonemes, transforms, GAN trains on text and audio from Facebook AI. JS disabled! Watch Wav2vec: Semi-supervised and Unsupervised Speech Recognition on Youtube Watch video "Wav2vec: Semi-supervised and Unsupervised Speech Recognition" Wav2vec is fascinating in that it combines several neural network architectures and methods: CNN, transformer, quantization, and GAN tra
Google may pull 'fediverse' Android apps for allegedly enabling hate speech (updated)
- 3 users
- www.engadget.com
- テクノロジー
- 2020/08/30
Google has stepped up its fight against hate speech, but there are concerns that it might be too aggressive. As Private Internet Access reports, Google has warned it will pull multiple “fediverse” apps (groups of interconnected servers used for web publishing) from the Play Store for allegedly inciting hate speech. Android titles like Fedilab, Husky, and Subway Tooter purportedly help users connec
Join the Premier Global Free Speech App | Parler
- 3 users
- parler.com
- 政治と経済
- 2021/01/07
People are on Parler. Join Parler to connect with others you may know. Parler is where free speech thrives.
Patrick Moelleken 🇺🇦 on Twitter: ".@ZelenskyyUa's tv address to the Russian (!) people might be the most moving speech that I've ever seen in my enti… https://t.co/dtLV1sMPFd"
- 3 users
- twitter.com/PMoelleken
- 暮らし
- 2022/02/25
.@ZelenskyyUa's tv address to the Russian (!) people might be the most moving speech that I've ever seen in my enti… https://t.co/dtLV1sMPFd
- 翻訳
- ウクライナ
- ロシア
- twitter
- 動画
GitHub - alexrudall/ruby-openai: OpenAI API + Ruby! 🤖❤️ Now with Assistants, Threads, Messages, Runs and Text to Speech 🍾
- 3 users
- github.com/alexrudall
- テクノロジー
- 2022/12/16
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
- Ruby
- あとで読む
Free TTS | Text to Speech Mp3 Free Online
- 3 users
- freetts.com
- 学び
- 2021/11/14
Free TTS: Text to Speech Mp3 Free Online Convert text to speech free online and download it as Mp3 in natural voices. 100% free for commercial use! Learn more about new FreeTTS and get 50% off coupon code. Read the post ↗. 0 / 5000 characters | Current Limit: 6000 characters per week. Characters Left: 6000. We support SSML TTS
- text
Voicemaker® - Text to Speech Converter
- 3 users
- voicemaker.in
- 暮らし
- 2021/01/12
SliderEnable slider on pause, pitch, speed textarea buttons.
The Moon Speech - John F. Kennedy at Rice University
- 3 users
- www.lizard-tail.com
- 政治と経済
- 2019/09/18
"The Moon Speech" John F. Kennedy at Rice University - September 12, 1962 Discription 1962年9月12日にライス大学のライス・スタジアムで行われたJFKの有名なスピーチです。この年、ライス大学はNASAに有人宇宙飛行センターのための広大な敷地を寄付しました。このスピーチはこれを記念したもので、ライス大学から名誉客員教授として招かれたケネディ大統領が、最初の講義を行うと言うスタイルで行われました。かの有名な "We choose go to the moon." のくだりは、スピーチのちょうど真ん中あたり、8分40秒頃から始まります。 Copyright : Rice University このあまりに有名なスピーチは、なぜこの時、この場所で行われなければならなかったんでしょうか？多くの名演説がそうであ
- 政治
『英国王のスピーチ（The King’s Speech）』映画の感想、レビュー、あらすじ、ネタバレ
- 3 users
- moneys-wines.com
- エンタメ
- 2019/10/14
『英国王のスピーチ（The King’s Speech）』とは？『英国王のスピーチ（The King’s Speech）』とは、2010年にギャガより配給、公開され、第83回アカデミー賞では主演男優賞、作品賞、監督賞、脚本賞を受賞しています。内容は、実在したイギリス王ジョージ6世と言語聴覚士であるライオネル・ローグ – ジェフリー・ラッシュとの二人の友情と、歴史を交えた伝記ドラマです。『イヴ・サンローラン（Yves Saint Laurent）』や『ベストセラー編集者パーキンズに捧ぐ』に似た男性同士の友情と、『マーガレット・サッチャー鉄の女の涙（ The Iron Lady）』のようなイギリスの歴史に関連した映画となっています。
Joaquin Phoenix's Oscars speech in full: 'We feel entitled to artificially inseminate a cow and steal her baby'
- 3 users
- www.theguardian.com
- 政治と経済
- 2020/02/10
This video has been removed. This could be because it launched early, our rights have expired, there was a legal issue, or for another reason.
[英語モチベーション] ロバートデニーロ演説 | 2015 TISCH graduation speech | 君達は終わってる| Robert De Niro |日本語字幕 | 英語字幕
- 3 users
- www.youtube.com
- エンタメ
- 2020/04/21
ロバートデニーロの2015年にニューヨーク大学芸術学部卒業講演の要約です。ロバートデニーロは卒業演説を「You are fxxked」から始めます。「皆さんは、終わってる」。。。温室のような学校の環境で育ってきた後輩達に、ジャングルのような現実世界に進むために必要なアドバイスを惜しみなく語っています。彼が経験し、最も重要だと考えているあなたの人生の知恵は何でしょうか？彼の話を聞いてみましょう。今日も楽しんでください。 ———————————————————————— Copyright Disclaimer Under Section 107 of the Copyright Act 1976, allowance is made for "fair use" for purposes such as criticism, commenting, news reportin
Web Speech Apiの紹介(ブラウザ上での音声認識の話題) - Qiita
- 3 users
- qiita.com/misoMac
- テクノロジー
- 2019/12/15
はじめにみなさんこんにちは。株式会社みんなのウェディングでデザイナーをしている私です。 12月もそろそろ折り返し地点ですね。くふうアドベントカレンダーも後半に差し掛かって来ましたね。今回はWeb Speech Apiの音声認識についての紹介です。詳しい実装方法などには触れてないです。こんなのもあるよ！程度です。 Web Speech Apiとは Web Speech API は、音声データをウェブアプリに組み入れることを可能にします。Web Speech API は、SpeechSynthesis (Text-to-Speech; 音声合成) と SpeechRecognition (Asynchronous Speech Recognition; 非同期音声認識) の 2 つの部分から成り立っています。音声認識の方は、リンク先で確認できる通り対応ブラウザがほぼ限られていますが、音
- JavaScript
Now you can transcribe speech with Google Translate
- 3 users
- blog.google
- テクノロジー
- 2020/03/18
The Queen gives coronavirus speech | nzherald.co.nz
- 3 users
- www.youtube.com
- 世の中
- 2020/04/06
Queen Elizabeth II: "We will succeed - and that success will belong to every one of us." Full story: https://bit.ly/2ywvE2Z Subscribe: https://goo.gl/LP45jX Check out our playlists: https://goo.gl/Swd249 Like NZ Herald on Facebook: https://goo.gl/tUC4oq Follow NZ Herald on Instagram: https://goo.gl/oLicXe Follow NZ Herald on Twitter: https://goo.gl/Wi6mbv
- 映像
- 医療
- Youtube
GitHub - r9y9/ttslearn: ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
- 3 users
- github.com/r9y9
- テクノロジー
- 2021/08/16
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
GoogleのCloud Speech-to-Textでリアルタイムに会話の文字起こし - ASKUL Engineering BLOG
- 3 users
- tech.askul.co.jp
- テクノロジー
- 2020/07/27
はじめに初めまして、4月からアスクルに新卒入社しました、「みわすけ」です。新卒エンジニアとして、まだまだ勉強中ではありますが、今回ヤフーさん主催の「Yahoo! JAPAN Internal Hack Day 17」というイベントに参加させていただきました。 HackDayとはテクノロジーを、もっと身近に、もっと楽しく。Hack Dayは、ものづくりの面白さを体験する祭典です。日本最大級のハッカソンや、注目のコンテンツを揃えた体験ブースなど、盛りだくさんのイベントを毎年開催しています。(https://hackday.jp より) その中で、我々アスクルチームは会議の議事録を取る行為をエンジニアリングで解決しようとなり、24時間で開発していきました。この記事ではその中で「発言を文字起こしする」部分に使用したGoogleのCloud Sppech-to-Textの使い方について解説しま
- hackathon
- yahoo
- python
Evidence of a predictive coding hierarchy in the human brain listening to speech - Nature Human Behaviour
- 3 users
- www.nature.com
- テクノロジー
- 2023/03/11
Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
- 機械学習
- 言語
【JS】Web Speech APIの音声読み上げ機能の実装方法（日本語 / 英語読み上げ）｜WEB CREATES
- 3 users
- web-creates.com
- テクノロジー
- 2023/01/20
WEBページで、WEB Speech APIを使ってWEBページのテキストを読み上げる実装方法を共有しようと思います。JavaScriptで実装をしました。今回は、個人開発で作成した英語学習サービスの英単語の発音を確認する際に、こちらのブラウザの音声合成機能を使用しました。英語を読み上げる際におこった不具合の対処も併せて紹介させていただきます。 Web Speech APIの音声読み上げ機能使ってみる【デモ】テキストを読み上げてみるどのように読み上げるのか確認できるデモサイトは下記になります。デモを見る <input type="text" id="text" name="text" value="吾輩は猫である。" placeholder="読み上げたいテキストを入力してください" /> <button onclick="readAloud()">読み上げる</button>
Shinzo Abe: Japan ex-leader assassinated while giving speech
- 3 users
- www.bbc.co.uk
- 政治と経済
- 2022/07/08
Japan's former prime minister Shinzo Abe has died in hospital after he was shot at a political campaign event. Abe was shot at twice while he was giving a speech on a street in the city of Nara on Friday morning.
Home AssistantでTTS(Text To Speech)を利用する。google-home-notifierの一歩先へ。 - Qiita
- 3 users
- qiita.com/odetarou
- テクノロジー
- 2021/02/08
Home AssistantでTTS(Text To Speech)を利用する。google-home-notifierの一歩先へ。TTSTextToSpeechHomeAssistantGooglehomenotifier Home AssistantにはTTS(Text To Speech)機能が標準で用意されています。任意のテキストをChromecastでGoogle Homeなどに喋らせることができます。ブラウザでダッシュボード画面を開いてテキストボックスに入力して喋らせたり、REST APIで呼ぶことも可能です。下記のようなことができます。 Home AssistantのText to Speech(TTS)でGoogle Home系にキャストできるのよさげ。 google-home-notifierはGoogle翻訳TTSを非公式に使ってて壊れやく微妙だったけどこれはC
- api
- google
フロント初心者がWeb Speech API を使ったシステムのフロント部分作ってみた！！ - Qiita
- 3 users
- qiita.com/yvngodowny
- テクノロジー
- 2023/07/04
はじめにインターンでフロントエンドを勉強し始めて1ヶ月の初心者が、「Web Speech API」を使って音声でChatGPTと連携できるWebサイトを作ってみました。今回はフロント部分についてです。システム概要 Webぺージでブラウザの音声認識機能を使ってChatGPTに話しかけ、音声合成機能を使って音声でChatGPTの返答を受け取ることができるシステムを作りました！システムのイメージはこんな感じ。PythonでChatGPTのAPIを叩く部分をバックエンドとしました。今回はフロントエンドについてなので、画面構成・音声認識・音声合成について書いていきます。音声認識・合成で使ったAPIについて今回は音声の認識・合成に「Web Speech API」を使いました。音声認識・合成については以下の記事を参考にさせてただきました。音声認識 , 音声合成 1.画面構成まずHTML
議事録担当なんてなくそうよ。Google Cloud Speech -to-Textを使ってみた
- 3 users
- techceed-inc.com
- テクノロジー
- 2020/05/18
はじめまして。イノベーション本部の田中です。ここ最近、お仕事では画像認識をやっておりますが、今回は音声認識のお話です。皆さん、議事録書くの面倒ではないですか？楽をしたいなーと思い、 Googleの音声認識(Cloud Speech-to-Text)を試してみたのでご紹介します。 Cloud Speech-to-Textについて機械学習を活用して音声をテキストに変換してくれる、GoogleのAPIサービスです。音声認識の精度が高く、多くの言語にも対応しているということで評判が良いAPIです。詳しくは、公式サイトをご確認ください。 (https://cloud.google.com/speech-to-text/?hl=ja) 取り組み内容今回試した内容は大きく2つです。 PCのマイクから認識した音声をリアルタイムでテキストに変換変換したテキストは、Googleスプレッドシ
Speech to Text - オーディオからテキストへの翻訳 | Microsoft Azure
- 3 users
- azure.microsoft.com
- テクノロジー
- 2020/02/26
Azure を探索 Azure について安全かつ将来を見据えた、オンプレミス、ハイブリッド、マルチクラウド、エッジのクラウドソリューションについて調べるグローバルインフラストラクチャ他のどのプロバイダーよりも多くのリージョンを備える持続可能で信頼できるクラウドインフラストラクチャについての詳細情報クラウドの経済性 Azure の財務上および技術的に重要なガイダンスを利用して、クラウドのビジネスケースを作成する顧客イネーブルメント実績のあるツール、ガイダンス、リソースを使用して、クラウド移行の明確なパスを計画するお客様事例成功を収めたあらゆる規模と業界の企業によるイノベーションの例を参照する
Open JTalk - HMM-based Text-to-Speech System
- 3 users
- open-jtalk.sp.nitech.ac.jp
- 学び
- 2022/03/26
サンプル「小さな鰻屋に，熱気のようなものがみなぎる．」 (声質: 0.55 ピッチシフト: 0 話速: 1.0) wav 「一週間ばかり，ニューヨークを取材した．」 (声質: 0.45 ピッチシフト: 18 話速: 1.2) wav オプション声質の値を小さくすると女性，大きくすると男性のような声になります．ピッチシフトの値を調整することで，合成する音声の高さを半音単位で変更します．話速の値を小さくすると遅く，大きくすると速くなります．合成テキスト最大200字までの文章を合成できます． 2018/07/11 利用規約の一部を緩和しました． 2012/12/25 [Ver. 1.8] Open JTalkのバージョンを1.06に更新しました．女性話者「Mei (Happy)」「Mei (Bashful)」「Mei (Angry)」「Mei (Sad)」を追加しました．音質を安
Revoicer - AI text to speech online - Emotion-based AI Voices Generator
- 3 users
- revoicer.com
- テクノロジー
- 2023/01/08
3,000+ People can not be wrong. Put AI to work in your marketing
p5.jsとp5.speechで音声合成と音声認識 : だらっと学習帳
- 3 users
- blog.livedoor.jp/reona396
- テクノロジー
- 2019/12/19
Processing Advent Calendar 2019 参加記事 p5.jsには実はたくさんのライブラリが存在します。コミッターの皆様には足を向けて寝られません。 libraries | p5.js 今回はその中からp5.speechを利用して遊んでみたいと思います！ p5.speech | Speech synthesis and recognition for p5.js p5.speechは音声認識と音声合成のためのライブラリで、これらの機能をサクッと簡単に使えるようなしくみになっています。しゃべらせよう音声合成のサンプルが以下のページで公開されています。 https://idmnyu.github.io/p5.js-speech/examples/02speechbox.html テキストボックスに適当な文言を入力して(日本語もいけます)、Speakボタンを押すと読み上
- library
Spike Lee Won an Oscar. Read His Passionate Speech. (Published 2019)
- 3 users
- www.nytimes.com
- 政治と経済
- 2020/06/17
Spike Lee accepting the award for best adapted screenplay for his film “BlacKkKlansman.”Credit...Noel West for The New York Times Spike Lee finally won his first competitive Oscar, and his acceptance speech was a doozy. At Sunday night’s Oscar ceremony, Lee won best adapted screenplay for “BlacKkKlansman” (sharing the award with Charlie Wachtel, David Rabinowitz and Kevin Willmott) and walked on s
SpeechBrain: A PyTorch Speech Toolkit
- 3 users
- speechbrain.github.io
- テクノロジー
- 2019/09/11
2023 Online SpeechBrain Summit Register for free and join us online on August 28th for our first SpeechBrain Online Summit endorsed by ISCA as an official Interspeech 2023 satellite event! In this one-day summit, you will learn about the latest developments and updates of SpeechBrain, and engage in an open and collaborative discussion with the community. The summit features four industrial talks f
- 機械学習
Reddit, Acting Against Hate Speech, Bans ‘The_Donald’ Subreddit (Published 2020)
- 3 users
- www.nytimes.com
- テクノロジー
- 2020/06/30
“The_Donald” subreddit, now banned by Reddit, is home to more than 790,000 users who post memes, viral videos and supportive messages about President Trump.Credit...Pete Marovich for The New York Times SAN FRANCISCO — Reddit, one of the largest social networking and message board websites, on Monday banned its biggest community devoted to President Trump as part of an overhaul of its hate speech p
- インターネット
東條英機演説 / Speech by Hideki Tojo
- 3 users
- www.youtube.com
- 学び
- 2020/07/18
大詔を拝し奉りて（たいしょうをはいしたてまつりて）とは、1941年12月8日、大東亜戦争の対米英宣戦の詔勅が渙発されたことを受けて同日午後7時過ぎ、内閣総理大臣・東條英機が日本国民にラジオ放送を通じておこなった決意表明である。日本が戦争せざるを得なかった理由が分かります。 BGM付きバージョンはコチラ https://www.youtube.com/watch?v=m-aRHtK0XWM
- 歴史
Data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
- 3 users
- ai.meta.com
- 暮らし
- 2022/01/22
Data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language 概要While the general idea of self-supervised learning is identical across modalities, the actual algorithms and objectives differ widely because they were developed with a single modality in mind. To get us closer to general self-supervised learning, we present data2vec, a framework that uses the same learning
まうり塩🍊 FREEDOM OF SPEECH!!!!! on Twitter: "今日は以前からずっと翻訳したかった動画を紹介。これは私には衝撃の動画だったんです、いろんな意味で。かなり聞き取りづらく、悪口ばかりだし、複数人が同時に喋るので、翻訳は「意訳」がかなりあるという事をご了承下さい。雰囲気掴んで頂け… https://t.co/2r7V2ns1K3"
- 3 users
- twitter.com/anaiscalico
- 学び
- 2020/09/30
今日は以前からずっと翻訳したかった動画を紹介。これは私には衝撃の動画だったんです、いろんな意味で。かなり聞き取りづらく、悪口ばかりだし、複数人が同時に喋るので、翻訳は「意訳」がかなりあるという事をご了承下さい。雰囲気掴んで頂け… https://t.co/2r7V2ns1K3
Modern CSS Tooltips And Speech Bubbles (Part 2) — Smashing Magazine
- 3 users
- www.smashingmagazine.com
- テクノロジー
- 2024/03/08
In Part 1 of this series, Temani Afif explored different CSS techniques to create tooltip shapes. The main challenge was to rely on a single element and create optimized code that could easily be controlled using CSS variables to update the size, shape, and position of the tail. In this second part, you are going explore more shapes. I hope you were able to spend time getting familiar with the tec
- css
Joe Biden speech: Watch Democrat presidential nominee Biden address the country | by Bat Man | Nov, 2020 | Medium
- 3 users
- mbat5047.medium.com
- 暮らし
- 2020/11/07
Joe Biden speech: Watch Democrat presidential nominee Biden address the country Election officials in Michigan have rejected the claim that a ballot for a 118-year-old man was ever received or counted in the state. The denial comes after claims went viral on social media on Thursday that an absentee ballot for a resident born in 1902 called William Bradley was counted in the Michigan election, sei
GitHub - argmaxinc/WhisperKit: Swift native on-device speech recognition with Whisper for Apple Silicon
- 3 users
- github.com/argmaxinc
- テクノロジー
- 2024/03/01
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
「桃太郎デスマッチ」ー Azure / AWS / GCP 学習済みAIサービスで「桃太郎」を Speech To Text してみた話 - Qiita
- 3 users
- qiita.com/Futo_Horio
- テクノロジー
- 2020/03/02
「桃太郎デスマッチ」ー Azure / AWS / GCP 学習済みAIサービスで「桃太郎」を Speech To Text してみた話AWSAzureCognitiveServicesSpeechToTextGoogleCloud はじめに 2019年1月23日(木) に Microsoft 主催の Ignite The Tour : Osaka にコミュニティ登壇させていただきました。本記事は、上記イベントで発表させていただいた LT ( ライトニングトーク ) の内容を記事にしたものです。 ※また、本記事では、3大クラウドプラットフォーム ( Azure / AWS / GCP ) の Speech To Text サービスの性能を比較し、ランク付けをさせていただいておりますが、使用する音声の録音環境、録音デバイス、その他環境の差により、当記事の検証結果と異なる場合がございます
- あとで読む
喋った情景をアニメ化するAI「Scribbling Speech」　デモ版はないけど、学習データや処理内容を解説した
- 3 users
- www.itmedia.co.jp
- テクノロジー
- 2022/09/22
Scribbling Speechは音声からアニメを作るので、文章から絵を描くAIとは入力データも出力結果も異なる。ただし自然言語で与えたデータを解析して、その内容を別の手法で表現するという動作は変わらない。Scribbling Speechをみていくと、Midjourneyの基本的な考え方を想像できるだろう。 Scribbling Speechは音声解析とアニメ化の二段階構成早速いつものようにScribbling Speechを体験しようとしたができなかった。というのも、実際に動かして試せるWebアプリケーションやサービスが公開されていないからだ。しかし興味深いAIなので、体験できずとも取り上げたい。ありがたいことに、Scribbling Speechを開発したXinyue Yang氏が、詳しい解説記事を公開していた。今回はこれを参考にしながら紹介する。音声入力を単語に分解　名詞と
- AI