Where to point your RAG: Sourcing proprietary data for LLMs and AI agents
Analysis: There are many places within IT infrastructure that organizations can use to get a single proprietary data source for their large language models and AI agents doing RAG...
Storage news ticker – July 18
AWS said customers can boot Amazon Elastic Compute Cloud (Amazon EC2) instances on AWS Outposts using boot volumes backed by NetApp on-premises enterprise storage arrays and Pure Storage FlashArray,...
Liqid unveils composable GPU servers with CXL 2.0 memory pooling
Liqid has announced products enabling host server apps to access dynamically orchestrated GPU server systems built from pools of GPU, memory, and storage, focused on AI inferencing and agents.
Liqid...
Hammerspace pushes Open Flash Platform to rethink AI data storage
Interview: The Open Flash Platform (OFP) group aims to replace all-flash arrays with directly accessed flash cartridges that have a controller DPU, Linux and parallel NFS (pNFS) software, and...
Open Flash Platform group proposes SSD-free flash storage for AI era
An Open Flash Platform (OFP) group is aiming to rewrite flash storage standards for AI by getting rid of all-flash arrays and their controllers, replacing them with shelves of...
Quantum expands all-flash DXi line with higher-capacity backup boxes
Quantum has added two more DXi all-flash deduplicating backup storage products to its range, doubling and quadrupling the previous all-flash maximum capacity.
The DXi product line is built to store...
Cloudian plugs PyTorch into GPUDirect to juice AI training speeds
Cloudian engineers have added Nvidia GPUDirect support to a PyTorch connector to accelerate AI and machine learning workloads.
In the machine learning world, the Torch open source software library interfaces...
Meta superintelligence push to drive huge demand for storage
Meta is eyeing a massive AI datacenter expansion program with the chosen storage suppliers set for a bonanza.
CEO Mark Zuckerberg announced on Facebook: "We're going to invest hundreds of...
Quantum sheds senior staff as financial woes mount
Beleaguered storage vendor Quantum, which replaced CEO and chairman Jamie Lerner in June and CRO Henk Jan Spanjaard earlier this month, has made “a very large reduction” in staff,...
Kioxia UFS 4.1 flash promises smoother AI app performance
Kioxia is sampling the latest UFS v4.1 embedded smartphone and tablet flash with faster downloads and smoother app performance for on-device AI.
UFS (Universal Flash Storage) refers to small flash...
Graid going for Nvidia RAID gold
Graid occupies a technology niche with its Nvidia GPU-powered RAID cards and is powering ahead with a development roadmap featuring AI and high-performance computing (HPC) products.
The company, which says...
Nvidia extends LLM memory with tiered KV caching and Dynamo engine
Nvidia GPUs store vectors as key-value pairs in a large language model (LLM) memory cache – KV cache – which is tiered out in a multi-level structure ending with...