Beam

Artificial Intelligence

Decoding Strategies in Large Language Models 📚 Background 🏃‍♂️ Greedy Search ⚖️ Beam Search 🎲 Top-k sampling 🔬 Nucleus sampling Conclusion

The tokenizer, Byte-Pair Encoding on this instance, translates each token within the input text right into a corresponding token ID. Then, GPT-2 uses these token IDs as input and tries to predict the subsequent...

ASK ANA - June 8, 2023

Agentic AI from First Principles: Reflection

October 24, 2025

Microsoft’s ‘Mico’ personality upgrade

October 24, 2025

ChatGPT gains full enterprise insights with ‘Company Knowledge’

October 24, 2025

The brain power behind sustainable AI

October 24, 2025

When Transformers Sing: Adapting SpectralKD for Text-Based Knowledge Distillation

October 24, 2025

Popular categories

Artificial Intelligence8833 New Post1 My Blog1

Beam

Decoding Strategies in Large Language Models 📚 Background 🏃‍♂️ Greedy Search ⚖️ Beam Search 🎲 Top-k sampling 🔬 Nucleus sampling Conclusion

Recent posts

Agentic AI from First Principles: Reflection

Microsoft’s ‘Mico’ personality upgrade

ChatGPT gains full enterprise insights with ‘Company Knowledge’

The brain power behind sustainable AI

When Transformers Sing: Adapting SpectralKD for Text-Based Knowledge Distillation

Popular categories