Tag
31 articles
NVIDIA's Gated DeltaNet-2 decouples erase and write operations in linear attention, outperforming models like Mamba-2 and KDA in long-context tasks.
This explainer explores the significance of NVIDIA's RTX 5090D V2 GPU, its role in AI computing, and the geopolitical implications of China's import ban.
NVIDIA has released Nemotron-Labs-Diffusion, a tri-mode language model that achieves up to six times the token throughput per forward pass compared to Qwen3-8B, offering enhanced efficiency and versatility in AI text generation.
Learn how cloud computing and AI chip deals work, using the recent partnership between Lambda and Hudson River Trading as an example. Understand the basics of GPU access through cloud services.
Learn how NVIDIA's new 4-bit pretraining method allows AI models to be trained more efficiently, using less memory and power while maintaining high accuracy.
NVIDIA introduces SANA-WM, a 2.6 billion-parameter open-source world model capable of generating 60-second 720p videos with precise camera control on a single GPU.
Former President Donald Trump claimed he and Xi Jinping discussed AI guardrails during their Beijing meeting, but no agreement was signed. H200 chip deliveries to China remain stalled amid ongoing tech tensions.
NVIDIA has released cuda-oxide, an experimental Rust-to-CUDA compiler backend that enables direct compilation of SIMT GPU kernels to PTX bytecode, streamlining GPU development for Rust developers.
NVIDIA CEO Jensen Huang urged Carnegie Mellon University’s Class of 2026 to embrace the AI revolution as a new industrial era, emphasizing the need for both innovation and safety.
Learn how NVIDIA's Star Elastic technology packs multiple AI models into one file, making AI more efficient and accessible. This new method trains models of different sizes together, saving time and resources while improving performance.
Learn how speculative decoding helps AI systems generate text faster without losing accuracy, using a fast guess-and-check method.
LG is exploring collaboration with NVIDIA on physical AI, data centers, and mobility, signaling a major shift in how AI integrates with physical systems.