Artificial Intelligence, CPUs and Processors
Nvidia claims 10x cost savings with open-source inference models
Nvidia has released analysis showing a 4X to 10X reduction in cost per token for AI inferencing by switching to open source models. The cost reductions were achieved by pairing Nvidia’s Blackwell GPU platform with open-source models from Baseten, DeepInfra, Fireworks AI, and Together AI. Their tests showed significant cost improvements across healthcare, gaming, agentic chat, and customer service. […
Intel teams with SoftBank to develop new memory type
Intel announced a groundbreaking collaboration with SoftBank in September 2023, targeting the development of a new memory type designed to address escalating demands in AI-driven data centers. This partnership leverages Intel’s chip manufacturing expertise and SoftBank’s Arm architecture ecosystem to create hybrid memory modules that promise 2x faster data access speeds compared to traditional DRAM....