leaderboard

Hugging Face, LLM leaderboard modified… “1st and 2nd place in the primary week are Q12 and Rama3”

HuggingFace has released a brand new open source Large Language Model (LLM) leaderboard following recent criticisms concerning the effectiveness of benchmarks. The primary rankings show strong performances from Chinese models. Tom's Hardware reported that in...

The AI model leaderboard

Welcome, AI enthusiasts.The AI world’s favorite open LLM scoreboard just got a serious upgrade, and Alibaba’s Qwen 2 is on top of the rostrum (for now). Hugging Face’s recent benchmarks are set to alter...

Upstage-NIA adds reasoning and arithmetic reasoning indicators to Korean LLM leaderboard

Upstage (CEO Kim Seong-hoon) and the Korea Intelligence and Information Society Agency (NIA, Director Hwang Jong-seong) announced on the eleventh that they will probably be upgrading the jointly operated 'Open Ko-LLM Leaderboard' by adding...

Yanolja and EdenTS ​​receive the ‘Korean LLM of the 12 months’ excellent model award

Yanolja and EdenTS ​​were chosen as models of the yr within the 'Ko-LLM Leaderboard', a Korean model performance evaluation. The Korea Intelligence Information Society Agency (NIA, President Hwang Jong-seong) and Upstage (CEO Kim Seong-hoon) announced...

W&B unveils Korean LLM leaderboard 'Tiger'… “Korea is an important market in Asia”

Weight & Bias (W&B), an American AI developer platform, announced on the 2nd that it has began operating the 'Horangi Korean LLM Leaderboard', which discloses the rating of the Korean language performance evaluation results...

Deepnoid ranks 1st in 'Open Ko-LLM Leaderboard'… Average increase by 1 point in 1 week

Deepnoid (CEO Choi Woo-sik), Korea's first-generation medical artificial intelligence (AI) specialist, won the 'Open Ko-LLM Leaderboard' hosted by the Korea Intelligence and Information Society Agency (NIA) and Upstage. It was announced on the fifteenth...

[2ì›” 2주차] Eden T&S, a specialist in RPA, advanced to second place…”Successful conversion of information and AI”

Eden T&S (CEO Kim Yeon-gi) has emerged as a frontrunner within the Korean Large Language Model (LLM) rating. Eden T&S's model 'DataVortexS-10.7B-dpo-v1.0' ranked 2nd within the 2nd week of February within the 'Open Ko-LLM Leaderboard'...

Hugging Face Leaderboard's first model to surpass 80 points on average appears… “Rapid development of open source”

For the primary time, a model with a mean rating of over 80 has appeared on the Hugging Face open source large language model (LLM) leaderboard. As well as, the open source camp...

Recent posts

Popular categories

ASK DUKE