inference

MS unveils math-specific inference technology… “Outperforms o1 performance with sLM”

Microsoft (MS) announced a brand new technology, 'rStar-Math', which significantly improves the mathematical reasoning ability of Small Language Model (sLM). It was revealed that this technology dramatically improved sLM's mathematical problem-solving ability and showed...

5 innovations and one challenge that ‘o3’ modified AI

Evaluation has emerged showing that the inference model 'o3' released by OpenAI has significantly raised the extent of existing artificial intelligence (AI) in five elements. However, the big cost stays as an issue that...

“2nd generation ‘Blackwell’ chip released with 50% improved performance in 6 months…Give attention to inference”

Despite concerns that mass production of Blackwell's 'B200' GPU could also be delayed than expected resulting from design issues, predictions have emerged that the discharge schedule of the 2nd generation Blackwell 'B300' GPU will...

Hugging Face, inference technology for SLM, ‘Test-Time Scaling’ open source released

Hugging Face has unveiled technology to enhance the inference performance of the open source Small Language Model (sLM). Like OpenAI's 'o1', it is predicated on the 'Test-Time Compute' method, which improves response quality by...

The Best Inference APIs for Open LLMs to Enhance Your AI App

Imagine this: you have got built an AI app with an incredible idea, however it struggles to deliver because running large language models (LLMs) looks like attempting to host a concert with a cassette...

Combining Large and Small LLMs to Boost Inference Time and Quality

Implementing Speculative and Contrastive DecodingLarge Language models are comprised of billions of parameters (weights). For every word it generates, the model has to perform computationally expensive calculations across all of those parameters.Large Language models...

Greg Brockman, Chairman of OpenAI, “Specializing in infrastructure business beyond software”

OpenAI Chairman Greg Brockman participated within the SK 'AI Summit' keynote session on the 4th and confirmed OpenAI's entry into the 'infrastructure business' field, including manufacturing its own chips. Chairman Brockman said, “Developing artificial general...

Using Objective Bayesian Inference to Interpret Election Polls

Tips on how to construct a polls-only objective Bayesian model that goes from a state polling result in probability of winning the stateWith the presidential election approaching, a matter I, and I expect many...

Recent posts

Popular categories

ASK ANA