[B! voice] efclのブックマーク

efcl id:efcl

voiceに関するefclのブックマーク (44)

GitHub - amicalhq/amical: 🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no keyboard needed. 🆓 Powered by open source models, works offline, fast and accurate.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
efcl 2026/01/15
音声入力デスクトップアプリ。 Whisperを使った音声認識で、ローカルで動作しオフラインでも使用できる。アクティブなアプリに応じたコンテキスト認識、カスタムホットキー、フローティングウィジェットなどの機能を持

voice

software

IME
リンク
GitHub - cursorless-dev/cursorless: Don't let the cursor slow you down
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
efcl 2025/02/24
talonベースでVSCode拡張と組み合わせて、エディタ上のカーソル操作を声で行うソフトウェア

voice

software

VSCode
リンク
GitHub - fishaudio/fish-speech: SOTA Open Source TTS
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
efcl 2024/09/14
英語、中国語、日本語、ポルトガル語に対応してるText to Speech

voice

MachineLearning
リンク
VS Code Speech - Visual Studio Marketplace
Launch VS Code Quick Open (Ctrl+P), paste the following command, and press enter. Speech extension for Visual Studio Code The Speech extension for Visual Studio Code adds speech-to-text and text-to-speech capabilities to Visual Studio Code. No internet connection is required, the voice audio data is processed locally on your computer. For example, you can use this extension anywhere VS Code offers
efcl 2024/02/11
VSCodeでCopilot ChatのHold To Speechを有効化する拡張。現在は英語のみ

VSCode

voice
リンク
GitHub - thewh1teagle/vibe: Transcribe on your own!
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
efcl 2024/01/26
Whisper + Tauriの音声文字起こしアプリ

voice

software
リンク
BetterDictation.com
Something went wrong! Hang in there while we get back on track Type so fast, your boss will think there's 3 of you! BetterDictation is your personal scribe. You speak, and it will quickly and flawlessly transcribe into any app.
efcl 2024/01/25
Whisperを使った音声入力アプリ。 Push to Talkをサポートしている。

voice

software
リンク
superwhisperでの音声入力を試す
superwhisperという、whisper.cppを使った音声入力ができるmacOSアプリケーションを最近使っています。基本的にはggerganov/whisper.cppのモデルを使って、音声認識しながら文字入力ができるアプリケーションです。特徴 Whisperの認識精度が高いかなり早く喋っても認識してくれる日本語も認識してくれるモデルがある日本語で喋って英語に翻訳してくれる機能もあるオフライン対応有料: サブスクと買い切りの2種類のプランがある無料で15分のトライアル、その後は選べるモデルが制限される公式サイトのデモをみると、かなり早く喋っても認識してくれるのがわかります。大抵の人にとっては、多分文字入力するよりしゃべったほうが早いぐらいの入力速度が出ると思います。 superwhisper 長文はそこまで得意じゃないけど、1行とか2行ぐらいの文章はかなり
efcl 2024/01/17
superwhisperという音声認識でテキストを書けるアプリケーションについて。日本語を喋って英語を出力したり、認識精度も良くてオフラインでも動くので、気軽な音声入力として便利。

macOS

voice

software
リンク
AI Voice Changer: Use AI To Change Your Voice For Free
Transf orm your voice into another while preserving emotion, delivery, and nuance Say it how you want and hear it delivered in a completely different voice, with full control over the performance. Capture whispers, laughs, accents, and subtle emotional cues.
efcl 2024/01/15
Speech to Speechサービス音声を音声に変換してくれる

webservice

voice
リンク
MurmurType - Best Mac Speech to Text App | Keeps Up With How You Think
efcl 2024/01/13
whisperでの音声認識を使った音声入力と翻訳

software

voice
リンク
音声の多様な情報を引き出し、機械に伝えるためのパラ言語認識〜意図態度認識・感情認識で何ができるようになるのか〜
はじめに：機械が人の発話を理解するためには意図・態度・感情の認識が必須近年、流暢な会話調の文章を自動的に生成する技術が登場し、人と日常会話が可能な機械が実現することに期待が高まっています。テキストを入出力としたチャットでは、すでに機械が流暢な応答をしてくれるようになったのは皆さんも実感なさっていると思います。一方で、人と機械が音声で対話することを考えてみましょう。音声には、テキスト（言語情報）では表現することができない、多種多様な声のニュアンス（パラ言語情報）が含まれています。テキストにすれば全く同じ内容であっても、例えば声色の違いによって、伝えている意味が正反対になることすらあるのです。そのため、音声で人と円滑に会話を行うことができる機械を実現するためには、音声に含まれる多様なパラ言語情報を機械が認識するための技術が欠かせません。そこで本記事では、パラ言語情報の認識技術の中でも特に
efcl 2024/01/13
音声で文字起こしはできないけど、途中の肯定や疑問などの感情的な表現(パラ言語)を機械がどう認識させるかという話。これ結構方言の影響とかも受けそうな気はする

voice

article
リンク
音声が伝える情報を逃さず捉えるための技術
はじめに「音声言語処理技術」と聞くと, 音声を文字に書き起こすための技術（音声認識）や, 書き起こした文字を機械で解析・解釈する技術を想像されるかもしれません. しかし音声には, 例えば話し手の声色や息づかいのような情報も含まれています. このような文字に書き起こせない情報は, 従来の音声言語処理技術ではあまり扱われてきませんでした. 実際のコミュニケーションの現場を考えてみると, 私たちは音声に加えてジャスチャーや表情などを使って, 多種多様な情報を意識的または無意識的にやりとりしています*1. 音声・非音声を問わず, これらの文字化できない情報のことを総称して「非言語」情報と呼びます. そして非言語情報の中でも, 話し手が音声を使って, 意識的に相手に伝えようとする情報のことを「パラ言語」情報と呼びます. 本記事では, このパラ言語情報について考察します. パラとは「周辺的な, 補足的
efcl 2024/01/13
"非言語情報の中でも, 話し手が音声を使って, 意識的に相手に伝えようとする情報のことを「パラ言語」情報と呼びます" "パラとは「周辺的な, 補足的な」という意味です. パラ言語情報とは, それ自体は文字にできないもの

voice

article
リンク
superwhisper
Write 3x faster, without lifting a finger.superwhisper AI powered voice to text
efcl 2024/01/10
Whisper.cpp などを使ってオフラインでも動く音声認識のテキスト入力、クリップボードへのコピー、英語への翻訳アプリケーション。 Cmd+Spaceでどこからでも入力を開始できてそのままペーストできる。日本語で喋って英語に

mac

software

voice
リンク
おーぷんAI文字起こし - 誰でも簡単・安全・無料。高精度文字起こし
安心セキュリティ音声ファイルをパソコンから外部にアップロードする事はありません。極秘の音声ファイルでも安心して使えます。
efcl 2023/12/14
wisperをwasmで動かしてる

voice
リンク
Free AI Voice Generator & Voice Agents Platform | ElevenLabs
In the ancient land of Eldoria, where skies shimmered and forests, whispered secrets to the wind, lived a dragon named Zephyros. [sarcastically] Not the “burn it all down” kind... [giggles] but he was gentle, wise, with eyes like old stars. [whispers] Even the birds fell silent when he passed. In the ancient land of Eldoria, where skies shimmered and forests, whispered secrets to the wind, lived
efcl 2023/02/26
音声から音声合成を作るサービス

voice

webservice
リンク
On-premise Speech Recognition
efcl 2023/02/01
オンプレで固定費用な文字起こし

webservice

voice
リンク
GitHub - neonbjb/tortoise-tts: A multi-voice TTS system trained with an emphasis on quality
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
efcl 2023/01/02
Text-to-speech

english

voice
リンク
GitHub - ggml-org/whisper.cpp: Port of OpenAI's Whisper model in C/C++
Stable: v1.7.6 / Roadmap High-performance inference of OpenAI's Whisper automatic speech recognition (ASR) model: Plain C/C++ implementation without dependencies Apple Silicon first-class citizen - optimized via ARM NEON, Accelerate framework, Metal and Core ML AVX intrinsics support for x86 architectures VSX intrinsics support for POWER architectures Mixed F16 / F32 precision Integer quantization
efcl 2022/11/23
WhisperのC++実装

voice

translate
リンク
GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
efcl 2022/09/22
音声ファイルから文字起こし、翻訳ができるツール。大規模なデータセットでトレーニングしていて、日本語を含む色々な言語に対応している

voice

MachineLearning

translate
リンク
青空朗読 | 青空文庫に所蔵されている本を朗読しています
本サイトはスクリーンリーダーに対応するようにページを改編しました。これからも目の不自由な方が音声読み上げソフトを使い朗読を楽しんでいただけるように改善していきます。朗読がどなたにも心休まる豊かな時間になることを願って。まだ、不十分な点があると思います。使いづらいところがありましたらこちらまでお知らせください
efcl 2022/09/03
青空文庫の朗読

voice

book
リンク
Voice Enabled Camera - Take selfies by voice command App - App Store
efcl 2022/07/03
音声でシャッターを切れるシンプルなアプリ

iOS

voice
リンク
1 2 3 次のページ