sur WEKA
WEKA Boosts Token Efficiency with NVIDIA BlueField-4 STX
WEKA has announced its integration with NVIDIA's STX reference architecture, enhancing token production by 6.5x within the same GPU footprint. This development slashes inference costs for AI-driven organizations. The integration of WEKA's NeuralMesh™ and Augmented Memory Grid™ technologies on NVIDIA STX provides a high-throughput memory storage crucial for efficient AI operations.
The shared key-value (KV) cache infrastructure solves the inference cost issue by maintaining context across agents and sessions, eliminating redundant computation. This integration is vital for scaling agentic systems, particularly in software engineering, where memory infrastructure frequently becomes a bottleneck.
AI innovators like Firmus are transforming inference economics using Augmented Memory Grid on NeuralMesh. The solution promises substantial improvements in responsiveness and scalability, maintaining high KV cache hit rates and avoiding performance degradation typical in DRAM-only architectures.
R. H.
Copyright © 2026 FinanzWire, tous droits de reproduction et de représentation réservés.
Clause de non responsabilité : bien que puisées aux meilleures sources, les informations et analyses diffusées par FinanzWire sont fournies à titre indicatif et ne constituent en aucune manière une incitation à prendre position sur les marchés financiers.
Cliquez ici pour consulter le communiqué de presse ayant servi de base à la rédaction de cette brève
Voir toutes les actualités de WEKA