长短期记忆(1997)[pdf]
本文由 Hochreiter 和 Schmidhuber 于 1997 年发表,提出了长短期记忆(LSTM)网络,这是一种特殊的循环神经网络(RNN)架构。LSTM 通过引入记忆单元和门控机制(输入门、遗忘门、输出门),有效解决了传统 RNN 在处理长序列数据时面临的梯度消失和梯度爆炸问题,从而能够学习长期依赖关系。该论文是深度学习领域的奠基性工作之一,对自然语言处理、语音识别和时间序列预测等众多领域产生了深远影响。
本文由 Hochreiter 和 Schmidhuber 于 1997 年发表,提出了长短期记忆(LSTM)网络,这是一种特殊的循环神经网络(RNN)架构。LSTM 通过引入记忆单元和门控机制(输入门、遗忘门、输出门),有效解决了传统 RNN 在处理长序列数据时面临的梯度消失和梯度爆炸问题,从而能够学习长期依赖关系。该论文是深度学习领域的奠基性工作之一,对自然语言处理、语音识别和时间序列预测等众多领域产生了深远影响。
A state-designed worm from 2005 called Fast16 sat undetected on VirusTotal for nearly a decade. It intercepted executable files at the kernel level and silently altered floating-point calculations in high-precision engineering software like LS-DYNA, which was used in Iran's nuclear weapons research. Unlike Stuxnet, Fast16 received little public attention for over twenty years.
Paul Graham reports that Y Combinator startups now have over 75% of their code written by AI, a threshold crossed at least one to two years ago. This parallels a similar transformation at Google, where AI-written code went from 0% to 75% in about two years.
Scientists are increasingly concerned about the potential collapse of the Atlantic Meridional Overturning Circulation (AMOC), a critical ocean current system. Such a collapse could have severe consequences for North America and Europe.
A compromised version of the LiteLLM Python package (version 1.82.8) was briefly available on PyPI, capable of exfiltrating sensitive credentials like SSH keys and cloud secrets. The malicious package affected any project that depended on LiteLLM, though it was only available for about an hour before discovery.
A supply chain attack has compromised the popular npm axios HTTP client library with 300 million weekly downloads. Malicious versions install a remote access trojan, though some users may have avoided infection through version pinning or older installations. Security experts warn this is a live compromise affecting one of npm's most depended-on packages.