site:venturebeat.com - Search News

IndexCache, a new sparse attention optimizer, delivers 1.82x faster inference on long-context AI models

Researchers at Tsinghua University and Z.ai built IndexCache to eliminate redundant computation in sparse attention models ...

22h

Mistral AI just released a text-to-speech model it says beats ElevenLabs — and it's giving away the weights for free

Mistral AI launches Voxtral TTS, an open-weight enterprise voice model that runs on a smartphone and challenges ElevenLabs in ...

MuleRun launches self-evolving AI workforce, signaling a shift toward the ‘digital employee’ era

The current AI landscape is dominated by reactive models—tools that wait for a prompt before delivering an output. MuleRun is built on a different premise: anticipating certain tasks based on context.

Intercom's new post-trained Fin Apex 1.0 beats GPT-5.4 and Claude Sonnet 4.6 at customer service resolutions

Intercom plans to expand Fin beyond customer service into sales and marketing—positioning it as a direct competitor to ...

How xMemory cuts token costs and context bloat in AI agents

When standard RAG pipelines retrieve redundant conversational data, long-term AI agents lose coherence and burn tokens.

Oracle converges the AI data stack to give enterprise agents a single version of truth

Matt Kimball, vice president and principal analyst at Moor Insights and Strategy, told VentureBeat the data layer is where ...

Cloudflare’s new Dynamic Workers ditch containers to run AI agent code 100x faster

Cloudflare says dynamically loaded Workers are priced at $0.002 per unique Worker loaded per day, in addition to standard CPU ...

Ai2 releases MolmoWeb, an open-weight visual web agent with 30K human task trajectories and a full training stack

Ai2's MolmoWeb is the first open-weight visual web agent to ship with its full training dataset, giving enterprise teams the ...

The consequential AI work that actually moves the needle for enterprises

The most consequential AI work happening now is focused on the practical matters of governance, orchestration, and iteration.

Xiaomi stuns with new MiMo-V2-Pro LLM nearing GPT-5.2, Opus 4.6 performance at a fraction of the cost

MiMo-V2-Pro utilizes a 7:1 hybrid ratio (increased from 5:1 in the Flash version) to manage its massive 1M-token context ...

Helix Earth Secures $12 Million in Oversubscribed Seed 2 Funding to Revolutionize Energy Efficiency and Humidity Management in Commercial HVAC Systems

Helix Earth leverages hardware technology co-invented at NASA to disrupt commercial air conditioning, industrial filtration, and CO2 capture markets ...

Google's new TurboQuant algorithm speeds up AI memory 8x, cutting costs by 50% or more

Within 24 hours of the release, community members began porting the algorithm to popular local AI libraries like MLX for ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results