Openai Audio API - 搜索 News

1 个月

OpenAI launches new voice intelligence features in its API

The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...

25 天

绝杀，OpenAI正式接管人类耳朵，首个GPT-5级推理音频模型来了

【导读】绝杀！OpenAI发布GPT-Realtime-2：首个GPT-5级推理音频模型，OpenAI正式接管人类耳朵人类与机器的最后一道「防火墙」——键盘，正在彻底消失。今天凌晨，OpenAI又给世界带来一次震撼。这一次，他们不卷文字，不卷视频，而是要把那个曾让无数人惊艳、又让无数人遗憾的Samantha——电影《Her》中的AI——彻底带进现实。 OpenAI正式宣布，推出GPT-Real ...

1 个月

OpenAI 最智能 AI 语音模型：GPT-Realtime-2 登场，GPT-5 级推理能力

GPT-Realtime-2 专为实时交互设计，是首款具备 GPT-5 级推理能力的语音模型。它在保持对话自然流畅的前提下，能在对话过程中进行推理、调用工具，并处理用户的打断或纠正。这意味着开发者可以构建更复杂的语音助手，并能执行多步骤任务。

1 个月

AI有嘴了，OpenAI 连发三语音模型

昨天凌晨，OpenAI发布了三款音频模型：GPT-Realtime-2、GPT-Realtime-Translate和GPT-Realtime-Whisper。 OpenAI官网的表述是，新模型可以让开发者构建能在用户说话时“推理、翻译和转写”的实时语音产品。三款模型已经开放给开发者测试。这次更新的重点在于三款模型不同场景分工。 GPT-Realtime-2面向实时语音Agent场景，它是Ope ...

腾讯网

OpenAI让模型“张嘴”，你要注意了：辱骂AI，也很贵

OpenAI CEO 山姆·奥特曼，图片经由AI处理文丨苏扬编辑丨徐青阳 ...

Ars Technica

OpenAI’s API users get full access to the new o1 model

Application developers who access OpenAI through its long-running API will now have access to the company’s latest full o1 model, rather than the months-old o1-preview. The upgrade is one of a number ...

Geeky Gadgets

OpenAI Launches New Speech-to-Text AI Audio Models API for Developers

OpenAI has today introduced a suite of advanced audio models and tools through its API, designed to empower developers in creating sophisticated, voice-driven applications. These updates include ...

1 个月

OpenAI unveils three audio models for real-time voice tasks

OpenAI introduced three audio models for its developer platform on Thursday, aiming to make voice-based software agents more ‌conversational and capable of completing tasks in real time.

1 个月

OpenAI’s latest API models bring live translation and transcription to voice apps

OpenAI has introduced a new set of voice AI models capable of real-time reasoning, translation, and transcription, allowing ...

U.S. News & World Report

OpenAI Unveils Three Audio Models for Real-Time Voice Tasks

May 7 (Reuters) - OpenAI introduced ⁠three ⁠audio models for ⁠its developer platform on Thursday, aiming to make voice-based software agents more conversational ‌and capable of completing ‌tasks in ...

Neowin

OpenAI announces next-generation audio models to power voice agents

In recent months, OpenAI has released several new tools, including Operator, Deep Research, Computer-Using Agents, and the Responses API, focusing on text-based agents. Today, OpenAI announced new ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果