optimization

A Generalizable MARL-LP Approach for Scheduling in Logistics

Introduction that always operates with surprising inefficiency: manual processes, piles of paperwork, legal complexities. Many corporations still run on paper or Excel and don’t even collect data on their shipments. But what if an organization...

Decisioning on the Edge: Policy Matching at Scale

This text was written in collaboration with César Ortega, whose insights and discussions helped shape the ideas presented here. the best data product starts with sitting down with business partners to know day-to-day workflows,...

Optimizing Token Generation in PyTorch Decoder Models

which have pervaded nearly every facet of our day by day lives are autoregressive decoder models. These models apply compute-heavy kernel operations to churn out tokens one after the other in a way...

Iron Triangles: Powerful Tools for Analyzing Trade-Offs in AI Product Development

and operating AI products involves making trade-offs. For instance, a higher-quality product may take more time and resources to construct, while complex inference calls could also be slower and costlier. These trade-offs are...

Distributed Reinforcement Learning for Scalable High-Performance Policy Optimization

on Real-World Problems is Hard Reinforcement learning looks straightforward in controlled settings: well-defined states, dense rewards, stationary dynamics, unlimited simulation. Most benchmark results are produced under those assumptions. Observations are partial and noisy, rewards...

TDS Newsletter: Beyond Prompt Engineering: The Latest Frontiers of LLM Optimization

Never miss a brand new edition of , our weekly newsletter featuring a top-notch number of editors’ picks, deep dives, community news, and more. Most of the issues practitioners encountered when LLMs first burst onto the...

Automatic Prompt Optimization for Multimodal Vision Agents: A Self-Driving Automobile Example

Optimizing Multimodal Agents Multimodal AI agents, those who can process text and pictures (or other media), are rapidly entering real-world domains like autonomous driving, healthcare, and robotics. In these settings, we now have traditionally used...

Overcoming Nonsmoothness and Control Chattering in Nonconvex Optimal Control Problems

One might encounter various frustrating difficulties when attempting to numerically solve a difficult nonlinear and nonconvex optimal control problem. In this text I'll consider such a difficult problem, that of finding the shortest path...

Recent posts

Popular categories

ASK ANA