タイトル「speech」を検索 - はてなブックマーク

161 - 178 件 / 178件

新着順人気順

絞り込み

検索対象
ブックマーク数
期間
セーフサーチ

speechの検索結果161 - 178 件 / 178件

Riva Speech AI SDK - Get Started
- 3 users
- developer.nvidia.com
- テクノロジー
- 2021/04/13
NVIDIA Riva NVIDIA® Riva is a set of GPU-accelerated multilingual speech and translation microservices for building fully customizable, real-time conversational AI pipelines. Riva includes automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) and is deployable in all clouds, in data centers, at the edge, or on embedded devices. With Riva, organizations can
Techno-Speech, Inc. / 株式会社テクノスピーチ
- 3 users
- www.techno-speech.com
- 暮らし
- 2021/01/15
最新のAI技術で人間の喋り方・歌い方をリアルに再現する音声創作ソフトウェアブランド「VoiSona」です。
Using AI to decode speech from brain activity
- 3 users
- ai.meta.com
- テクノロジー
- 2022/09/02
Every year, more than 69 million people around the world suffer traumatic brain injury, which leaves many of them unable to communicate through speech, typing, or gestures. These people’s lives could dramatically improve if researchers developed a technology to decode language directly from noninvasive brain recordings. Today, we’re sharing research that takes a step toward this goal. We’ve develo
2020年4月22日 FNNプライムニュース『ノーベル賞・本庶佑氏コロナ対策に緊急提言政府の対策で勝てるか + 韓国の良い所は見習わないと佐藤正久参議院議員韓国のコロナ対応を絶賛』 - 田中康夫 Speech To Text Online
- 3 users
- nippon2014be.hatenadiary.jp
- 世の中
- 2020/04/29
[佐藤正久]韓国は感染症に対する危機意識がかなり高いんです。MARSでの教訓もあるので、今回非常に感染症に対する感度、これが高い為に備蓄を含めて、あるいは態勢含めてやはり速いんです。そういう部分がやっぱり、今回我々としての、韓国の良い所は見習わないといけない。後で議論になるいろんな、PCRセンターを含めて韓国は一月からもうやってるんです。今、四月でしょ？で、もう三ヶ月の差があるんです。そのぐらい最初から危機感が高い。 * [竹内友佳]本庶さんは新型コロナウイルスとの戦いが今どういった状況にあるとご覧になっていますでしょうか。 [本庶佑]今仰ったこと、特に佐藤さんが仰ったことはその通りでね、韓国からは大変に見習うことが多いと思いますし、自衛隊、厚労省、こういったとこの連携とか、そういうことはやはりこういう場合にですね、政治家だけでなかなか判断できないから、やはり医療関係の専門家、基礎としてサ
大規模コーパスでGoogle Cloud Speech To Text APIの精度検証を行う & アップデート内容の検証 - OPTiM TECH BLOG
- 3 users
- tech-blog.optim.co.jp
- テクノロジー
- 2020/03/19
どうもこんにちは！新型コロナウイルスの影響で卒業式が中止になった、2020年新卒入社予定の山口です。今回はGoogle Cloud Speech-to-Text API（以下GST）を大規模コーパスで精度検証した結果と、GSTアップデートの検証内容について共有していけたらと思います。大規模コーパスでGSTの精度検証を行う JVS (Japanese versatile speech) corpusについて精度検証について認識精度の比較音量ごとによる精度の比較アップデート検証話者識別句読点２つを同時に試してみるとまとめ JVS (Japanese versatile speech) corpus ライセンス表記過去のGSTに関する記事もどうぞ tech-blog.optim.co.jp tech-blog.optim.co.jp 大規模コーパスでGSTの精度検証を行う
- あとで読む
GitHub - myshell-ai/MeloTTS: High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
- 3 users
- github.com/myshell-ai
- テクノロジー
- 2024/02/28
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
- japanese
- english
- ai
- github
A 2019 Guide to Speech Synthesis with Deep Learning - Fritz ai
- 3 users
- fritz.ai
- テクノロジー
- 2019/08/29
Artificial production of human speech is known as speech synthesis. This machine learning-based technique is applicable in text-to-speech, music generation, speech generation, speech-enabled devices, navigation systems, and accessibility for visually-impaired people. In this article, we’ll look at research and model architectures that have been written and developed to do just that using deep lear
米トヨタ、Googleの「Speech On-Device」採用--自然音声がネット接続不要に
- 3 users
- japan.cnet.com
- テクノロジー
- 2022/10/28
Googleは10月14日、トヨタ自動車とのパートナーシップを拡大すると発表した。トヨタと「LEXUS」の次世代オーディオマルチメディアシステムと、「Google Cloud」のAIベースの音声サービスを連携させる。 Google Cloudの新たなAI製品で、クラウド上で利用できるAIベースの音声認識、合成機能を組み込みデバイスに搭載する「Speech On-Device」を、今後提供するトヨタおよびLEXUS車に追加する。米国市場向けとなる2023年モデルの「トヨタカローラ」シリーズや、「タンドラ／セコイア／LEXUS NX／LEXUS RX」および、EVモデルの「LEXUS RZ」など、最新世代のトヨタオーディオマルチメディアとLEXUSインターフェースインフォテインメントシステムにおいて、車両が音声リクエストを直接処理し、音声クエリを実行できるようになる。自然音声機能のインター
GitHub - uezo/dify-voicevox-tts: VOICEVOX text-to-speech custom model for Dify
- 3 users
- github.com/uezo
- テクノロジー
- 2024/07/03
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
- Dify
Former Japanese Prime Minister Shinzo Abe assassinated during campaign speech, hospital officials confirm
- 3 users
- www.foxnews.com
- 政治と経済
- 2022/07/08
New Terms of UseNew Privacy PolicyYour Privacy ChoicesClosed Caption PolicyHelpContact UsAccessibility Statement This material may not be published, broadcast, rewritten, or redistributed. ©2024 FOX News Network, LLC. All rights reserved. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset. Powered and implemented by FactSet Digital Solutions. Legal Sta
wav2vec Unsupervised: Speech recognition without supervision
- 3 users
- ai.meta.com
- テクノロジー
- 2021/05/24
High-performance speech recognition with no supervision at all What the research is:Whether it’s giving directions, answering questions, or carrying out requests, speech recognition makes life easier in countless ways. But today the technology is available for only a small fraction of the thousands of languages spoken around the globe. This is because high-quality systems need to be trained with l
- 機械学習
- ai
GitHub - fishaudio/fish-speech: Brand new TTS solution
- 3 users
- github.com/fishaudio
- テクノロジー
- 2024/07/03
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
- github
AI Voice Generator with Text to Speech and Speech to Speech | Resemble AI
- 3 users
- www.resemble.ai
- テクノロジー
- 2020/01/08
Realtime text-to-speech to bring your game characters to life
Amazon ConnectとWatson Speech To Textの連携ー通話内容をテキスト化しメールで確認ー - Qiita
- 3 users
- qiita.com/K_Okumura
- テクノロジー
- 2019/09/26
はじめにコールセンターを新規に構築する際、Amazon Connectを使ってみようというニーズはあると思います。しかし通話内容のテキスト化や、テキスト化した内容の利用方法については、まだよく分からない部分も多いのではないでしょうか。例えば、COTOHAを使ったサービスとしては@azuki2iceさんの以下の記事があります。 Amazon Connectの通話をCOTOHAで音声認識させて通話テキストをSalesforceに自動登録する Amazon Connectは他サービスでも使えるので、今回はWatsonを連携し、通話内容を文字起こしして、その内容をメールで飛ばすサービスを作ってみたいと思います。本記事の内容を実装することで、以下の図のようなサービスが実現できます。なおこのサービスは構築にあたり、@Masaakiさんにご協力いただきました。ありがとうございます。構成 Am
Researcher Breaks reCAPTCHA With Google’s Speech-to-Text API
- 3 users
- threatpost.com
- テクノロジー
- 2021/01/05
Researcher uses an old unCAPTCHA trick against latest the audio version of reCAPTCHA, with a 97 percent success rate. An old attack method dating back to 2017 that uses voice-to-text to bypass CAPTCHA protections turns out to still work on Google’s latest reCAPTCHA v3. That’s according to researcher Nikolai Tschacher, who posted a video proof-of-concept (PoC) of the attack on Jan. 2. CAPTCHA, intr
Azure Cognitive ServicesのSpeech to Textで書き起こしをしてみよう - Qiita
- 3 users
- qiita.com/yamachu
- テクノロジー
- 2019/12/27
メリークリスマス！（遅刻） Azure AI Advent Calendar 2019 25日目のエントリーです。みなさんクリスマスイブからクリスマスにかけていかがお過ごしでしたか？私は本記事を書くために進捗の6時間を過ごして寝不足です。さて、今回はAzure Cognitive Servicesの中の一つである、Speech ServiceのSpeech to Textの使い方や使ってみた結果などを紹介していきます。実際に動かしてみたコードも載せるので、試してみたいけど書くの面倒だし…という方も安心してお読み下さい。音声変換 - Speech Service - Azure Cognitive Services | Microsoft Docs 用意するもの Azureのサブスクリプション .NET Core 3.0のアプリケーションがビルド出来る環境書き起こししたい音声始
Apple previews Live Speech, Personal Voice, and more new accessibility features
- 3 users
- www.apple.com
- テクノロジー
- 2023/05/17
CUPERTINO, CALIFORNIA Apple today previewed software features for cognitive, vision, hearing, and mobility accessibility, along with innovative tools for individuals who are nonspeaking or at risk of losing their ability to speak. These updates draw on advances in hardware and software, include on-device machine learning to ensure user privacy, and expand on Apple’s long-standing commitment to mak
WatsonのSpeech To Text(STT)をカスタマイズすれば「俺のSTT」を作れるよ - Qiita
- 3 users
- qiita.com/ishida330
- テクノロジー
- 2019/11/23
こんにちわ！石田です。たまたま仕事でSTTのカスタマイズの機会があって、Qiitaの記事をみたら「STT入門」的なものは多いけれど、カスタマイズの方法に具体的に言及しているものは割と少なかったので、いまさらながら記事にしました。「AKB関連の発言や用語だけは異常な高精度で認識するSTT」でも「ドリルすな/せんのかいの発話だけを認識できるSTT1」でも、皆様のビジネス要件と趣味ご嗜好にあわせた「俺の･私のSTT」を作ってみたらいかがでしょうか。（簡単ですよ）要は(TL;DR;) STTとは音声（オーディオ）を文字に変換するWatsonの「文字起こし」サービス素の（=IBM提供の）STTが知ってるのは一般的な日本語（一般的な辞書)だけ固有名詞(ex. 会社名・商品名)や業界/趣味/専門用語・独自のいい回しなどは「素の」STTでは認識できないが、簡単に教えることができる。これをモデルのカス