Automating the maths for decision-making under uncertainty


One reason deep learning exploded over the past decade was the supply of programming languages that might automate the maths — college-level calculus — that is required to coach each latest model. Neural networks are trained by tuning their parameters to try to maximise a rating that may be rapidly calculated for training data. The equations used to regulate the parameters in each tuning step was derived painstakingly by hand. Deep learning platforms use a way called automatic differentiation to calculate the adjustments routinely. This allowed researchers to rapidly explore an enormous space of models, and find those that actually worked, without having to know the underlying math.

But what about problems like climate modeling, or financial planning, where the underlying scenarios are fundamentally uncertain? For these problems, calculus alone isn’t enough — you furthermore mght need probability theory. The “rating” isn’t any longer only a deterministic function of the parameters. As an alternative, it’s defined by a stochastic model that makes random selections to model unknowns. Should you try to make use of deep learning platforms on these problems, they’ll easily give the unsuitable answer. To repair this problem, MIT researchers developed ADEV, which extends automatic differentiation to handle models that make random selections. This brings the advantages of AI programming to a wider class of problems, enabling rapid experimentation with models that may reason about uncertain situations.

Lead creator and MIT electrical engineering and computer science PhD student Alex Lew says he hopes people shall be less wary of using probabilistic models now that there’s a tool to routinely differentiate them. “The necessity to derive low-variance, unbiased gradient estimators by hand can result in a perception that probabilistic models are trickier or more finicky to work with than deterministic ones. But probability is an incredibly useful gizmo for modeling the world. My hope is that by providing a framework for constructing these estimators routinely, ADEV will make it more attractive to experiment with probabilistic models, possibly enabling latest discoveries and advances in AI and beyond.”

Sasa Misailovic, an associate professor on the University of Illinois at Urbana-Champaign who was not involved on this research, adds: “Because the probabilistic programming paradigm is emerging to resolve various problems in science and engineering, questions arise on how we will make efficient software implementations built on solid mathematical principles. ADEV presents such a foundation for modular and compositional probabilistic inference with derivatives. ADEV brings the advantages of probabilistic programming — automated math and more scalable inference algorithms — to a wider range of problems where the goal isn’t simply to infer what might be true but to choose what motion to take next.”

Along with climate modeling and financial modeling, ADEV may be used for operations research — for instance, simulating customer queues for call centers to attenuate expected wait times, by simulating the wait processes and evaluating the standard of outcomes — or for tuning the algorithm that a robot uses to understand physical objects. Co-author Mathieu Huot says he’s excited to see ADEV “used as a design space for novel low-variance estimators, a key challenge in probabilistic computations.”

The research, awarded the SIGPLAN Distinguished Paper award at POPL 2023, is co-authored by Vikash Mansighka, who leads MIT’s Probabilistic Computing Project within the Department of Brain and Cognitive Sciences and the Computer Science and Artificial Intelligence Laboratory, and helps lead the MIT Quest for Intelligence, in addition to Mathieu Huot and Sam Staton, each at Oxford University. Huot adds, “ADEV gives a unified framework for reasoning about the ever present problem of estimating gradients unbiasedly, in a clean, elegant and compositional way.” The research was supported by the National Science Foundation, the DARPA Machine Common Sense program, and a philanthropic gift from the Siegel Family Foundation.

“Lots of our most controversial decisions — from climate policy to the tax code — boil right down to decision-making under uncertainty. ADEV makes it easier to experiment with latest ways to resolve these problems, by automating a few of the hardest math,” says Mansinghka. “For any problem that we will model using a probabilistic program, we’ve got latest, automated ways to tune the parameters to attempt to create outcomes that we would like, and avoid outcomes that we do not.”


What are your thoughts on this topic?
Let us know in the comments below.


0 0 votes
Article Rating
Newest Most Voted
Inline Feedbacks
View all comments

Share this article

Recent posts

Would love your thoughts, please comment.x