Show HN:免费实时语音翻译器
这是一款利用Chrome原生API在浏览器中实现实时语音翻译的工具,可将麦克风音频实时转换为其他语言,适用于多语言交流场景。
这是一款利用Chrome原生API在浏览器中实现实时语音翻译的工具,可将麦克风音频实时转换为其他语言,适用于多语言交流场景。
Anthropic has introduced a 1 million token context window for its Claude Opus 4.6 and Sonnet 4.6 models, representing a significant technical advancement. The company is offering this increased capacity without additional charges to users.
Qwen3.6-27B is a new open weight AI model that claims flagship-level coding performance while being significantly smaller than its predecessor. The 27-billion parameter model outperforms the previous 397-billion parameter Qwen3.5-397B-A17B on coding benchmarks. The author tested a quantized 16.8GB version locally and demonstrated its capabilities by generating SVG images from text prompts.
Microsoft released VibeVoice, an MIT-licensed speech-to-text model with built-in speaker diarization. A test on a MacBook Pro transcribed one hour of audio in about 9 minutes, using up to 61.5GB of RAM. The model outputs JSON with text, timestamps, and speaker IDs, but is limited to one hour per run.
Google Meet is rolling out a speech translation feature for mobile devices that translates spoken conversation between languages with a short delay, using a rough imitation of the original speaker's voice. Currently supporting English, Spanish, French, German, Portuguese, and Italian, the feature is still in early alpha and showed inconsistent results across different devices.
The Servo browser engine is now available as an embeddable library on crates.io. A CLI tool called servo-shot was created to take screenshots of webpages using the new crate. While compiling Servo to WebAssembly isn't feasible, a playground was built for experimenting with html5ever and markup5ever_rcdom crates in WebAssembly.