Deploying DFlash block diffusion on NVIDIA hardware accelerates autoregressive LLMs during latency-sensitive inference.
The open-source model combines a one-million-token context window with architectural updates aimed at lowering the cost of ...
Z.ai’s GLM-5.2 is an open-source model aimed at long-context coding-agent workflows, with support for a one million-token ...
It allows engineering teams to host frontier-level AI on their own sovereign infrastructure, entirely eliminating vendor lock ...
In recent days, a new large language model from China has started circulating through technical circles with an unusual mix ...
Explore the Chinese open-source AI model challenging OpenAI and Anthropic with powerful coding abilities, agentic workflows, ...
Xiaomi MiMo-V2.5-Pro-UltraSpeed just hit 1,000 tokens per second 15x faster than ChatGPT on standard GPUs with no custom chips. Here's what Xiaomi MiMo is and why this speed record rewrites AI ...
Google's open-source diffusion language model generates 256 tokens in parallel and self-corrects, hitting 4x speed on one GPU ...
Allegro DVT, the leader in Semiconductor Video IPs and Video Compliance Tools, announces the availability its real-time AV2 Decoder IP integrated into its Pulsar™ D400 Series Multi-Standard Decoding ...
A deal of that magnitude would dwarf the US$558 million that Zhipu raised in its Hong Kong IPO, when the shares were priced ...
Just when the AI industry’s attention seemed fixed on OpenAI, Google and Anthropic, a new Chinese model has stolen the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results