text-to-speechの人気記事 31件 - はてなブックマーク

1 - 31 件 / 31件

新着順人気順

絞り込み

検索対象
ブックマーク数
期間
セーフサーチ

text-to-speechの検索結果1 - 31 件 / 31件

タグ検索の該当結果が少ないため、タイトル検索結果を表示しています。

text-to-speechに関するエントリは31件あります。機械学習、 AI、音声などが関連タグです。人気エントリには『月ノ美兎さんの音声合成ツール(Text To Speech) を作ってみた - Qiita』などがあります。

月ノ美兎さんの音声合成ツール(Text To Speech) を作ってみた - Qiita
- 170 users
- qiita.com/K2_ML
- テクノロジー
- 2020/05/29
何をした？ Youtube上に公開されている動画の音声から、ディープラーニング技術を用いた音声合成ツールを構築しました。今回対象にしたのは、バーチャルユーチューバー・にじさんじの委員長こと月ノ美兎さん（Youtubeチャンネル）　です。 ※選出理由は、単純に私がYoutube上で一番推している方だからです。成果動画から抽出した音声と、音声を文章に起こしたテキストの組み合わせのデータセット約50分ぶんを教師データとして学習した結果 ※学習に必要なデータ量は最低でも1時間程度と言われているので、まだまだ足りていません… 月ノ美兎さんの音声合成ツールを作ってみた https://t.co/YVdWW9vREb via @YouTube — K2 (@K2ML2) May 29, 2020 発話内容が不明瞭な箇所がありますが、一応ご本人の声に近い音声を作成することができているかと思います
ElevenLabs: Free Text to Speech & AI Voice Generator | ElevenLabs
- 62 users
- elevenlabs.io
- テクノロジー
- 2023/01/13
Create the most realistic speech with our AI audio platformPioneering research in Text to Speech, AI Voice Generator, and more
- AI
- 音声合成
- voice
- audio
- 機械学習
- technology
- webservice
音声文字起こし技術で業務効率化: Google Text to Speech と OpenAI Whisper の活用 - STORES Product Blog
- 49 users
- product.st.inc
- テクノロジー
- 2023/03/17
こんにちは、CTO室技術基盤グループの id:hogelog です。 STORES Product Blog でも多くの文字起こし記事がありますが、社内重要会議の文字起こしなど STORES 社内には様々なところで音声の文字起こし業務が存在します。そんな文字起こし業務ですが完全に人力で実施するのは作業コストがかなり高いです。今日はそのような業務を効率化する音声文字起こし技術とその変遷について紹介します。 Google Text to Speech の活用以前論より動くもの.fmを支える技術〜Podcast初心者が使っているツール紹介〜 - STORES Product Blog でも紹介しましたが STORES 社内では Google Text to Speech が STORES 社内の様々な文字起こし業務に活用されてきました。 product.st.inc Google Text
- 音声
- あとで読む
- AI
- tech
- ツール
- 技術
- google
スクウェア・エニックスによる、リアルな「架空言語」音声の作り方。Text-to-speechの機械学習モデルで生成した没入感の高いボイスコンテンツ【CEDEC+KYUSHU 2022】｜ゲームメーカーズ
- 45 users
- gamemakers.jp
- テクノロジー
- 2023/02/14
3年振りのリアル開催となった福岡で例年行われるゲーム開発者向けのカンファレンス「CEDEC+KYUSHU 2022」が、2022年11月12日（土）に開催されました。スクウェア・エニックス AI部のAIリサーチャー森友亮氏が登壇し、『意味が分からないからこそ、リアル～「架空言語」音声合成による、没入感の高いボイス付きコンテンツの実現～』と題した講演が行われました。見慣れた母国語のテキストから聞いたことのない架空言語の音声を生成する手法について語られた本講演をレポートします。 TEXT / じく EDIT / 酒井理恵
- 音声
- AI
- 機械学習
- 言語
- あとで読む
- techfeed

合成音声を使ってboard（SaaS）のチュートリアル動画を制作した話（VOICEPEAKとGoogle Cloud Text-to-Speech） - ヴェルク - IT起業の記録
- 26 users
- tamukai.blog.velc.jp
- テクノロジー
- 2023/02/06
boardというSaaSのチュートリアル動画を合成音声を使って制作しているので、その話を書いていきます。個別相談会のデモとチュートリアル動画以前書いた board（SaaS）個別相談会の変遷の中で少し触れたのですが、2021年に、個別相談会の中でやっていたデモをベースに、チュートリアル動画を制作しました。個別相談会では、業務の流れに沿って基本的な操作を一通り説明していくデモを行っていたのですが、途中に質問が挟まることも多く、そうすると、全体で30〜40分ほどかかってしまうことも多くありました。個別相談会は1時間枠なので、そのうち40分をデモで使うのは、時間の使い方としてもったいないなという課題感がありました。また、弊社は営業など外向けに活動するメンバーがいないため個別相談会はすべて僕がやっており、個別相談会を開催できる回数にも限りがありました。一方で「お試しする前にとりあえずデ
Introducing speech-to-text, text-to-speech, and more for 1,100+ languages
- 20 users
- ai.meta.com
- テクノロジー
- 2023/05/23
Introducing speech-to-text, text-to-speech, and more for 1,100+ languages Equipping machines with the ability to recognize and produce speech can make information accessible to many more people, including those who rely entirely on voice to access information. However, producing good-quality machine learning models for these tasks requires large amounts of labeled data — in this case, many thousan
- Meta
- 機械学習
- 人工知能
- AI
GitHub - mozilla/TTS: :robot: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
- 17 users
- github.com/mozilla
- テクノロジー
- 2021/04/15
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
【個人開発】ChatGPT × Text-to-Speech（Google）で知育サービスを作ってみた
- 14 users
- zenn.dev/yutafujiwara
- テクノロジー
- 2024/02/27
概要久しぶりに個人開発をしました！娘が「コペル」という幼児教室に通っています。その幼児教室で「コペルギネス」というゲームがあります。ゲームの内容は下記のようなゲームになります。 50個の(食べ物・動物等)絵が描いてある表を見て、順に作ったお話を先生がお話してくれます。それを２回聞いて、覚えます。自分の手元にも同じカードが５０個あるのでそれを子供が１人で回答用の表に順番通りに並べて貼っていくと言うゲームです。50個のカードのうち何個同じ場所に置けていたかを制限時間内に競います。正解した数が多かった人の勝ちです。単語と単語にお話をつけ、繋がったストーリーでイメージすることで記憶しやすくする効果があるそうです。このコペルギネスの練習をするとき、あらかじめ物語を作っておかないと、同じ絵を登場させてしまったり、答えを覚えていなかったりと結構大変なので、それを自動化するWebアプリを作りま
- あとで読む
Fire・iPhoneでのKindle本のテキスト読み上げ機能(Text-to-Speech)の使い方 - Random Life Blog
- 12 users
- randamlife.hatenablog.com
- テクノロジー
- 2020/02/14
Kindle本のテキスト読み上げ機能(Text-to-Speech) みなさん、こんばんは。最近、紙の本ではなくKindle本を導入して読書に勤しんでいるsamadaです。 Kindleはアマゾンのタブレット(Kindle、Fireなど)の他、スマホやiPhoneのアプリからも読めて非常に便利です。使い始めて気付いたのですが、さらに便利なのはテキスト読み上げ機能(Text-to-Speech)なるものがある点です。小説などの文章を音声で読み上げてくれる機能です。これを使うことで、普通のKidle本がなんちゃってオーディオブックに変わります。今日は、Kindle本のテキスト読み上げ機能(Text-to-Speech)について紹介したいと思います。 Kindleのテキスト読み上げ機能(Text-to-Speech)のメリット・デメリットメリットデメリット対応するKindle本
OpenAI TTS（Text to Speech）を Node.js で試してみた
- 9 users
- hyper-text.org
- テクノロジー
- 2023/11/08
OpenAI TTS（Text to Speech）を Node.js で試してみた先日開催された OpenAI Dev Day で新たに発表された、テキストから音声を生成する OpenAI TTS (Text To Speech) API が面白そうだったので、早速ですが Node.js 環境で簡単に試してみました。先日開催された OpenAI Dev Day では大幅な機能追加に加え、いくつかの新しい API も発表されました。その中で、テキストから音声を生成する OpenAI TTS (Text To Speech) API が面白そうだったので、早速ですが簡単に試してみることに。 Text to speech の概要や、API のリファレンスは下記にあります。 Text to speech - OpenAI API Create speech - API Referenc
- JavaScript
Narakeet - Easily Create Voiceovers and Narrated Videos Using Realistic Text to Speech!
- 8 users
- www.narakeet.com
- エンタメ
- 2020/03/30
Easily Create Voiceovers Using Realistic Text to Speech Stop wasting time on recording your voice, editing out mistakes and synchronising picture with sound. Just type or upload your script, select one of our 700 voices, and get a professionally sounding audio or video in minutes. Try Narakeet realistic text to speech free, no need to register. Get Started Text to SpeechWord PDF EPUB… to Audio Sli
- markdown
- video
- text
- 素材
Google Colab で OpenAI API の Text-to-Speech を試す｜npaka
- 7 users
- note.com/npaka
- テクノロジー
- 2023/11/08
「Google Colab」で「OpenAI API」の「Text-to-Speech」を試したので、まとめました。前回 1. Text-to-Speech「Text-to-Speech」、テキストの読み上げを行うAPIです。6つの内蔵ボイスが付属しており、次の目的で使用できます。・書かれたブログ投稿のナレーション・複数言語の音声を生成・ストリーミングを使用したリアルタイムオーディオ出力 2. セットアップColabでのセットアップ手順は、次のとおりです。 (1) パッケージのインストール。 # パッケージのインストール !pip install openai(2) 環境変数の準備。以下のコードの <OpenAI_APIキー> にはOpenAIのサイトで取得できるAPIキーを指定します。(有料) import os os.environ["OPENAI_API_KEY"] = "
- API
- google
English Text-to-speech software | Ondoku
- 6 users
- ondoku3.com
- テクノロジー
- 2020/12/14
text-to-speech software When you enter text in the text box below, you will hear it in your favorite voice. You can not only listen to the read text on the spot but also download it as an audio file (.mp3).
Speechify: Text to Speech Reader & AI Voice Generator
- 6 users
- speechify.com
- テクノロジー
- 2022/03/07
CUT YOUR READING TIME IN HALF. LET SPEECHIFY READ TO YOU.
- webサービス
グーグルから音声読み上げ（Text to Speech）を使った無料オーディオブック｜Sangmin Ahn
- 5 users
- note.com/sangmin
- テクノロジー
- 2020/12/23
こんにちは、Choimirai School のサンミンです。 0 はじめに前から紹介している音声読み上げ機能（Text to Speech、TTS）ですが、さらに進化し続けています。無料でアクセスできる本を ①TTSと②WaveNetを利用し、オーディオブックとして提供しているケースも増えています。読み上げの精度もたんたん人間に近づいている感じです。個人的な感想では、Gulliver's Travelsは言われないと分からないレベル。今回の note では、Google Playでダウンロードできる本を何冊が紹介させていただきます。 1 フィクションThe Legend of Sleepy Hollow Dracula Gulliver’s Travels The Strange Case of Dr Jekyll and Mr Hyde Frankenstein The War
- あとで読む
GitHub - coqui-ai/TTS: 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
- 5 users
- github.com/coqui-ai
- テクノロジー
- 2022/04/12
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
GitHub - snakers4/silero-models: Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
- 5 users
- github.com/snakers4
- テクノロジー
- 2022/06/20
Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks. Enterprise-grade STT made refreshingly simple (seriously, see benchmarks). We provide quality comparable to Google's STT (and sometimes even better) and we are not Google. As a bonus: No Kaldi; No compilation; No 20-step instructions; Also we have published TTS models that satisfy the following criteria: One-line usage; A
GitHub - NVIDIA/NeMo: A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
- 5 users
- github.com/NVIDIA
- テクノロジー
- 2020/11/30
Large Language Models and Multimodal Models New Llama 3.1 Support (2024-07-23) The NeMo Framework now supports training and customizing the Llama 3.1 collection of LLMs from Meta. Accelerate your Generative AI Distributed Training Workloads with the NVIDIA NeMo Framework on Amazon EKS (2024-07-16) NVIDIA NeMo Framework now runs distributed training workloads on an Amazon Elastic Kubernetes Service
GitHub - NATSpeech/NATSpeech: A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
- 4 users
- github.com/NATSpeech
- テクノロジー
- 2022/02/17
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
- tech
- あとで読む
GitHub - myshell-ai/MeloTTS: High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
- 4 users
- github.com/myshell-ai
- テクノロジー
- 2024/02/28
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
- japanese
- english
GitHub - yl4579/StyleTTS2: StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
- 4 users
- github.com/yl4579
- テクノロジー
- 2023/11/20
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
AI Voice Generator with Text to Speech and Speech to Speech | Resemble AI
- 4 users
- www.resemble.ai
- テクノロジー
- 2020/01/08
Use our Generative Voice AI models that are indistinguishable from humans
GitHub - collabora/WhisperSpeech: An Open Source text-to-speech system built by inverting Whisper.
- 4 users
- github.com/collabora
- テクノロジー
- 2024/01/18
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
AI Voice Generator: Versatile Text to Speech Software | Murf AI
- 3 users
- murf.ai
- テクノロジー
- 2023/02/07
Murf simplifies your business communication. Whether it’s voiceovers or translations, we provide solutions for every kind of project, making sure your message is clear, engaging, and far-reaching.
- AI
- webサービス
Voicemaker® - Text to Speech Converter
- 3 users
- voicemaker.in
- 暮らし
- 2021/01/12
SliderEnable slider on pause, pitch, speed textarea buttons.
FreeTTS - Text to speech mp3 online free
- 3 users
- freetts.com
- 学び
- 2021/11/14
Text to Speech Text to speech mp3 in natural voices. Free for commercial.
GitHub - r9y9/ttslearn: ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
- 3 users
- github.com/r9y9
- テクノロジー
- 2021/08/16
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
Home AssistantでTTS(Text To Speech)を利用する。google-home-notifierの一歩先へ。 - Qiita
- 3 users
- qiita.com/odetarou
- テクノロジー
- 2021/02/08
Home AssistantでTTS(Text To Speech)を利用する。google-home-notifierの一歩先へ。TTSTextToSpeechHomeAssistantGooglehomenotifier Home AssistantにはTTS(Text To Speech)機能が標準で用意されています。任意のテキストをChromecastでGoogle Homeなどに喋らせることができます。ブラウザでダッシュボード画面を開いてテキストボックスに入力して喋らせたり、REST APIで呼ぶことも可能です。下記のようなことができます。 Home AssistantのText to Speech(TTS)でGoogle Home系にキャストできるのよさげ。 google-home-notifierはGoogle翻訳TTSを非公式に使ってて壊れやく微妙だったけどこれはC
- api
- google
Open JTalk - HMM-based Text-to-Speech System
- 3 users
- open-jtalk.sp.nitech.ac.jp
- 学び
- 2022/03/26
サンプル「小さな鰻屋に，熱気のようなものがみなぎる．」 (声質: 0.55 ピッチシフト: 0 話速: 1.0) wav 「一週間ばかり，ニューヨークを取材した．」 (声質: 0.45 ピッチシフト: 18 話速: 1.2) wav オプション声質の値を小さくすると女性，大きくすると男性のような声になります．ピッチシフトの値を調整することで，合成する音声の高さを半音単位で変更します．話速の値を小さくすると遅く，大きくすると速くなります．合成テキスト最大200字までの文章を合成できます． 2018/07/11 利用規約の一部を緩和しました． 2012/12/25 [Ver. 1.8] Open JTalkのバージョンを1.06に更新しました．女性話者「Mei (Happy)」「Mei (Bashful)」「Mei (Angry)」「Mei (Sad)」を追加しました．音質を安
Revoicer - AI text to speech online - Emotion-based AI Voices Generator
- 3 users
- revoicer.com
- テクノロジー
- 2023/01/08
15000+ People can not be wrong. Put AI to work in your marketing
GitHub - uezo/dify-voicevox-tts: VOICEVOX text-to-speech custom model for Dify
- 3 users
- github.com/uezo
- テクノロジー
- 2024/07/03
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
- Dify