Groq.ch topics around fast inference
This is no official Groq website. This is a news collection around AI and fast inference.
News and Discussion around AI, LLMs and inference
NVIDIA and Microsoft Reinvent Windows PCs for the Age of Per
NVIDIA and Microsoft have introduced the RTX Spark platform, designed to power advanced on-device AI agents in Windows PCs with up to 1 petaflop of AI compute and 128GB unified memory. MediaTek collaborated on the custom CPU design, enhancing power e - AI
Read more
Netflix wiz creates app to slash AI bills, then open sources
Netflix engineer Claude Sonnet developed Headroom, an open-source app that significantly reduces AI token usage costs by compressing redundant data before feeding it into large language models. Headroom targets verbose and repetitive inputs like serv - AI
Read more
NVIDIA Vera Rubin Ramps Into Full Production to Power Agenti
NVIDIA announced that its Vera Rubin platform is now entering full production to support agentic AI factories globally. The new Spectrum-X Ethernet Photonics technology provides 5 times better power efficiency, 5 times longer AI uptime, and 1.3 times - AI
Read more
NVIDIA DGX Station for Windows Puts a Trillion-Parameter AI
NVIDIA's DGX Station for Windows delivers a trillion-parameter AI supercomputer directly to enterprise desktops. It enables creation of powerful AI agents tailored for 3D design and engineering tools. These agents act as intelligent assistants that u - AI
Read more
NVIDIA DSX Gives Infrastructure Builders the Playbook for AI
NVIDIA DSX provides a comprehensive platform for designing and operating AI factories, combining open source software, APIs, and NVIDIA's accelerated computing technology. It aims to streamline AI deployment, enhance operational reliability, and maxi - AI
Read more
Guess Who’s Got an AI Edge in a Tough Job Market?
The job market for recent graduates is beginning to look more welcoming after a period of difficulty. Companies are now better positioned to hire following headcount reductions and hiring freezes that resolved pandemic-era overstaffing. While artific - AI
Read more
Nvidia-backed $5 billion AI company tells CNBC of major Lond
Runway, a U.S. AI company backed by Nvidia and valued at $5 billion, plans a major expansion in London. It will set up its new European headquarters in the city. The firm will invest more than $200 million into the U.K.'s AI ecosystem by the end of 2 - AI
Read more
Nvidia-backed $5 billion AI company tells CNBC of major Lond
Nvidia-backed AI company Runway, valued at $5 billion, announced a major expansion establishing London as its new European headquarters. The company plans to invest over $200 million into the U.K.'s AI ecosystem by the end of 2028. This strategic mov - AI
Read more