Scaling

Scaling Vector Search: Comparing Quantization and Matryoshka Embeddings for 80% Cost Reduction

is on the core of AI infrastructure, powering multiple AI features from Retrieval-Augmented Generation (RAG) to agentic skills and long-term memory. Consequently, the demand for indexing large datasets is growing rapidly. For engineering...

Scaling ML Inference on Databricks: Liquid or Partitioned? Salted or Not?

Introduction a continuous variable for 4 different products. The machine learning pipeline was in-built Databricks and there are two major components.  Feature preparation in SQL with serverless compute. Inference on an ensemble of several hundred models using...

Scaling Feature Engineering Pipelines with Feast and Ray

project involving the construct of propensity models to predict customers’ prospective purchases, I encountered feature engineering issues that I had seen quite a few times before. These challenges might be broadly classified into two categories: 1)...

Scaling innovation in manufacturing with AI

“AI-powered digital twins mark a significant evolution in the longer term of producing, enabling real-time visualization of the whole production line, not only individual machines,” says Indranil Sircar, global chief technology officer...

Scaling Recommender Transformers to a Billion Parameters

! My name is Kirill Khrylchenko, and I lead the RecSys R&D team at Yandex. One in all our goals is to develop transformer technologies inside the context of recommender systems, an objective we’ve...

How you can construct AI scaling laws for efficient LLM training and budget maximization

When researchers are constructing large language models (LLMs), they aim to maximise...

Anthropic’s AI “vaccines,” scaling gamble, and why it shut OpenAI out

Good morning. It’s Monday, August 4th.On at the present time in tech history: In 1987the legendary Connection Machine CM-2 from Pondering Machines Corporation finally landed in research labs. With its 65,536 processors, it...

R.E.D.: Scaling Text Classification with Expert Delegation

With the brand new age of problem-solving augmented by Large Language Models (LLMs), only a handful of problems remain which have subpar solutions. Most classification problems (at a PoC level) will be solved by...

Recent posts

Popular categories

ASK ANA