Tag
5 articles
AI chip startup Rebellions raises $400 million at $2.3B valuation, positioning itself as a challenger to Nvidia's dominance in AI inference hardware.
Learn how to deploy machine learning models on Nvidia's new Vera Rubin platform with dedicated Groq 3 LPX inference chips using Docker containers and ONNX export.
Meta has unveiled four new generations of custom AI chips aimed at reducing inference costs and decreasing reliance on external GPU suppliers like Nvidia and AMD.
Learn about SPCT (Sparse Prompt Compression Technique), a new method developed by DeepSeek AI that improves the scalability of reward models during inference, making AI systems more efficient and cost-effective.
Toronto startup Taalas is pioneering hardwired AI chips that can process 17,000 tokens per second, challenging the dominance of programmable GPUs in AI inference.