supervised-fine-tuning

Artificial Intelligence

How OpenAI’s o3, Grok 3, DeepSeek R1, Gemini 2.0, and Claude 3.7 Differ in Their Reasoning Approaches

Large language models (LLMs) are rapidly evolving from easy text prediction systems into advanced reasoning engines able to tackling complex challenges. Initially designed to predict the following word in a sentence, these models have...

ASK ANA - March 29, 2025

Artificial Intelligence

Unraveling Large Language Model Hallucinations

Introduction In a YouTube video titled , former Senior Director of AI at Tesla, Andrej Karpathy discusses the psychology of Large Language Models (LLMs) as emergent cognitive effects of the training pipeline. This text is inspired by his...

ASK ANA - March 1, 2025

Artificial Intelligence

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Current long-context large language models (LLMs) can process inputs as much as 100,000 tokens, yet they struggle to generate outputs exceeding even a modest length of two,000 words. Controlled experiments reveal that the model’s...

ASK ANA - August 22, 2024

Artificial Intelligence

MoRA: High-Rank Updating for Parameter-Efficient Fantastic-Tuning

Owing to its robust performance and broad applicability when put next to other methods, LoRA or Low-Rank Adaption is some of the popular PEFT or Parameter Efficient Fantastic-Tuning methods for fine-tuning a big language...

ASK ANA - June 14, 2024

Artificial Intelligence

RAFT – A High quality-Tuning and RAG Approach to Domain-Specific Query Answering

Because the applications of enormous language models expand into specialized domains, the necessity for efficient and effective adaptation techniques becomes increasingly crucial. Enter RAFT (Retrieval Augmented High quality Tuning), a novel approach that mixes...

ASK ANA - March 31, 2024

Gemini 3 Pro scores 69% trust in blinded testing up from 16% for Gemini 2.5: The case for evaluating AI on real-world trust, not...

December 4, 2025

Popular categories

Artificial Intelligence9304 New Post1 My Blog1

supervised-fine-tuning

Recent posts

Helping power-system planners prepare for an unknown future

NVIDIA HGX B200 Reduces Embodied Carbon Emissions Intensity

a comprehensive public dataset for drug-target interaction modeling

Google DeepMind’s most advanced forecasting model

Gemini 3 Pro scores 69% trust in blinded testing up from 16% for Gemini 2.5: The case for evaluating AI on real-world trust, not...

Popular categories