Groq.ch topics around fast inference

This is no official Groq website. This is a news collection around AI and fast inference.

Get started now

News and Discussion around AI, LLMs and inference

NVIDIA and Microsoft Reinvent Windows PCs for the Age of Per

NVIDIA and Microsoft have introduced the RTX Spark platform, designed to power advanced on-device AI agents in Windows PCs with up to 1 petaflop of AI compute and 128GB unified memory. MediaTek collaborated on the custom CPU design, enhancing power e - AI


Read more

Netflix wiz creates app to slash AI bills, then open sources

Netflix engineer Claude Sonnet developed Headroom, an open-source app that significantly reduces AI token usage costs by compressing redundant data before feeding it into large language models. Headroom targets verbose and repetitive inputs like serv - AI


Read more

NVIDIA Vera Rubin Ramps Into Full Production to Power Agenti

NVIDIA announced that its Vera Rubin platform is now entering full production to support agentic AI factories globally. The new Spectrum-X Ethernet Photonics technology provides 5 times better power efficiency, 5 times longer AI uptime, and 1.3 times - AI


Read more

NVIDIA DGX Station for Windows Puts a Trillion-Parameter AI

NVIDIA's DGX Station for Windows delivers a trillion-parameter AI supercomputer directly to enterprise desktops. It enables creation of powerful AI agents tailored for 3D design and engineering tools. These agents act as intelligent assistants that u - AI


Read more

NVIDIA DSX Gives Infrastructure Builders the Playbook for AI

NVIDIA DSX provides a comprehensive platform for designing and operating AI factories, combining open source software, APIs, and NVIDIA's accelerated computing technology. It aims to streamline AI deployment, enhance operational reliability, and maxi - AI


Read more

Guess Who’s Got an AI Edge in a Tough Job Market?

The job market for recent graduates is beginning to look more welcoming after a period of difficulty. Companies are now better positioned to hire following headcount reductions and hiring freezes that resolved pandemic-era overstaffing. While artific - AI


Read more

Nvidia-backed $5 billion AI company tells CNBC of major Lond

Runway, a U.S. AI company backed by Nvidia and valued at $5 billion, plans a major expansion in London. It will set up its new European headquarters in the city. The firm will invest more than $200 million into the U.K.'s AI ecosystem by the end of 2 - AI


Read more

Nvidia-backed $5 billion AI company tells CNBC of major Lond

Nvidia-backed AI company Runway, valued at $5 billion, announced a major expansion establishing London as its new European headquarters. The company plans to invest over $200 million into the U.K.'s AI ecosystem by the end of 2028. This strategic mov - AI


Read more