#voice-ai — TECH Dashboard

Entries page 1/1 · 5 total

Thu, May 7 1 entries

blog codex 1mo ago ·

openai-blog

OpenAI、APIに新音声モデルを追加し音声AIを強化 Advancing voice intelligence with new models in the API

重要度 Medium Medium priority 重要度 Medium · 技術記事 · OpenAI / Codex Medium priority · technical post · OpenAI / Codex 公開 5月7日 Published May 7

AI要約 OpenAIはAPI経由で利用できる新しい音声モデル群を発表し、音声AIの性能を向上させた。より自然な発話、低レイテンシ、堅牢な認識を実現し、開発者が音声エージェントや対話アプリを構築しやすくなる。

EN Explore new realtime voice models in the OpenAI API that can reason, translate, and transcribe speech, enabling more natural and intelligent voice experiences.

#openai #voice-ai #speech-to-text +3

openai.com →

fallback

Thu, Apr 16 1 entries

NEW blog gemini 2mo ago ·

google-deepmind

Gemini 3.1 Flash TTS、表現力豊かな次世代AI音声を実現 Gemini 3.1 Flash TTS: the next generation of expressive AI speech

重要度 Medium Medium priority 重要度 Medium · 技術記事 · Gemini / Gemma Medium priority · technical post · Gemini / Gemma 公開 4月16日 Published Apr 16

AI要約 Google DeepMindは表現力に優れた次世代の音声合成モデル「Gemini 3.1 Flash TTS」を発表した。自然なイントネーションや感情表現を備え、低レイテンシかつ多言語対応で、開発者向けにAPIを通じて提供される。

EN Our newest audio model introduces granular audio tags that give you precise control to direct AI speech for expressive audio generation.

#deepmind #google #text-to-speech +3

deepmind.google →

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

media fallback

Fri, Mar 27 1 entries

NEW blog gemini 2mo ago ·

google-deepmind

Gemini 3.1 Flash Live登場、音声AIをより自然で信頼性の高いものに Gemini 3.1 Flash Live: Making audio AI more natural and reliable

重要度 Medium Medium priority 重要度 Medium · 技術記事 · Gemini / Gemma Medium priority · technical post · Gemini / Gemma 公開 3月27日 Published Mar 27

AI要約 Google DeepMindは音声対話向けモデル「Gemini 3.1 Flash Live」を発表した。応答の自然さと信頼性を高め、リアルタイム音声AIの実用性を一段と引き上げるアップデートとなる。開発者はLive APIを通じて低遅延の音声体験を構築できる。

EN Our latest voice model has improved precision and lower latency to make voice interactions more fluid, natural and precise.

#deepmind #google #voice-ai +3

deepmind.google →

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

media fallback

Sat, Dec 13 1 entries

NEW blog gemini 6mo ago ·

google-deepmind

Google DeepMind、Gemini音声モデルを刷新し高品質な音声体験を実現 Improved Gemini audio models for powerful voice experiences

重要度 Medium Medium priority 重要度 Medium · 技術記事 · Gemini / Gemma Medium priority · technical post · Gemini / Gemma 公開 12月13日 Published Dec 13

AI要約 Google DeepMindはGemini APIとVertex AI向けに改良された音声モデルを発表した。新たなネイティブ音声対話、TTS、音声認識(ASR)機能を提供し、より自然で表現豊かな会話体験を可能にする。エンタープライズ向け開発者が音声エージェントなどを構築できる。

EN Improved Gemini audio models for powerful voice experiences

#deepmind #google #gemini-api +4

deepmind.google →

Improved Gemini audio models for powerful voice experiences

media fallback

Thu, Aug 28 1 entries

🔥 HOT blog codex 9mo ago ·

openai-blog

OpenAI、gpt-realtimeとRealtime APIの大幅アップデートを発表 Introducing gpt-realtime and Realtime API updates

重要度 High High priority 重要度 High · 技術記事 · OpenAI / Codex High priority · technical post · OpenAI / Codex 公開 8月28日 Published Aug 28

AI要約 OpenAIが本番向け音声合成モデルgpt-realtimeとRealtime API正式版を公開。リモートMCPサーバー、画像入力、SIP電話対応などを追加。

EN We’re releasing a more advanced speech-to-speech model and new API capabilities including MCP server support, image input, and SIP phone calling support.

#mcp-server #openai #realtime-api +6

openai.com →

fallback

#voice-ai 5 total

Entries page 1/1 · 5 total

OpenAI、APIに新音声モデルを追加し音声AIを強化 Advancing voice intelligence with new models in the API

Gemini 3.1 Flash TTS、表現力豊かな次世代AI音声を実現 Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Gemini 3.1 Flash Live登場、音声AIをより自然で信頼性の高いものに Gemini 3.1 Flash Live: Making audio AI more natural and reliable

Google DeepMind、Gemini音声モデルを刷新し高品質な音声体験を実現 Improved Gemini audio models for powerful voice experiences

OpenAI、gpt-realtimeとRealtime APIの大幅アップデートを発表 Introducing gpt-realtime and Realtime API updates