OpenAI WebRTC 音频会话,现已支持文档上下文
作者在 2024 年 12 月基于 OpenAI 新推出的 WebRTC API 构建了音频会话工具的第一个版本。上个月,OpenAI 为该 API 引入了名为 GPT‑Realtime‑2 的新模型,号称是首个具备 GPT‑5 级推理能力的语音模型。由于该模型迟迟未在 ChatGPT iPhone 应用中上线,作者更新了原有工具,支持选择更优模型,并允许用户粘贴大量文档内容,以便在浏览器中通过语音对话的形式探讨任何感兴趣的信息。
作者在 2024 年 12 月基于 OpenAI 新推出的 WebRTC API 构建了音频会话工具的第一个版本。上个月,OpenAI 为该 API 引入了名为 GPT‑Realtime‑2 的新模型,号称是首个具备 GPT‑5 级推理能力的语音模型。由于该模型迟迟未在 ChatGPT iPhone 应用中上线,作者更新了原有工具,支持选择更优模型,并允许用户粘贴大量文档内容,以便在浏览器中通过语音对话的形式探讨任何感兴趣的信息。
Andrej Karpathy announces the release of Claude Fable 5, the same underlying model as Mythos but with added safeguards. He calls it a major step forward, particularly for long problem-solving sessions on difficult tasks, and describes it as state-of-the-art on nearly all benchmarks with exceptional performance in software engineering, research, and vision.
Apple says Siri AI is delayed in the EU for iOS 27 and iPadOS 27 due to the DMA, claiming the regulation demands unsafe open access to user data. The European Commission rejected Apple's proposed safety measures, leaving no timeline for release.
The U.S. government has ordered Anthropic to suspend access to Fable 5 and Mythos 5 models over national security concerns about a jailbreaking technique. Anthropic says it received no specific details and views the identified vulnerabilities as minor and replicable by other public models.
The US government ordered Anthropic to block foreign nationals from accessing its AI models. The author argues this shifts AI regulation from safety to nationalist control, treating technology as a weapon for Americans only, and warns Europe to build its own capabilities rather than rely on regulation alone.
A court has ruled Google can be held liable for AI-generated hallucinations produced by its systems, marking a significant legal precedent that could influence future cases and regulations in other jurisdictions.