Source

DeepSeek releases open source ‘R1’, a fine-tuned version of ‘V3’… “Same performance as o1, 90% cheaper cost”

DeepSeek, which has been evaluated because the world's best open source model with its 'V3' model, has now released the 'R1' series, an inference model that competes with OpenAI's 'o1' model, as open source. DeepSeek...

“I’m GPT-4”…DeepSeek, the strongest open source model, learns to generate data with an open AI model

It is understood that the open source model 'DeepSeek-V3' released by China's DeepSeek introduced itself as ChatGPT. In other words, it may be assumed that the info generated by 'GPT-4' was learned for model...

Constructing Trust in LLM Answers: Highlighting Source Texts in PDFs

100% accuracy isn’t every little thing: helping users navigate the document is the actual valueSo, you're constructing a RAG system or using an LLM to speak with documents. But users often ask: how can...

DeepSeek launches the biggest LLM in open source history… “Caught up with GPT-4o”

China's DeepSeek has unveiled 'DeepSeek-V3', the biggest open source large language model (LLM) ever. It was emphasized that this model has performance that surpasses existing open source models similar to Meta's 'Rama 3.1 405B'...

Hugging Face, inference technology for SLM, ‘Test-Time Scaling’ open source released

Hugging Face has unveiled technology to enhance the inference performance of the open source Small Language Model (sLM). Like OpenAI's 'o1', it is predicated on the 'Test-Time Compute' method, which improves response quality by...

LG AI Research Institute releases open source for 3 ‘ExaOne 3.5’ models… “Additional on-device and frontier level”

LG AI Research Institute (President Bae Kyung-hoon) announced on the tenth that it had released three models based on 'EXAONE 3.5' as open source. That is an update conducted 4 months after the discharge of...

The day after tomorrow, the 102B open source model with ‘strongest Korean performance’ revealed… “Outperforms each GPT-4o and Q12”

MOREH (CEO Jo Kang-won), a specialist in artificial intelligence (AI) infrastructure solutions, has opened its self-developed Korean foundation large language model (LLM) 'Llama-3-Motif-102B' to Hugging Face. It was announced on the third that it...

Agent Ecosystems, Data Integration, Open Source LLMs, and Other November Must-Reads

Our latest cohort of recent authorsEvery month, we’re thrilled to see a fresh group of authors join TDS, each sharing their very own unique voice, knowledge, and experience with our community. Should you’re searching...

Recent posts

Popular categories

ASK ANA