All biological function relies on how different proteins interact with one another. Protein-protein interactions facilitate every little thing from transcribing DNA and controlling cell division to higher-level functions in complex organisms.
Much stays unclear, nevertheless, about how these functions are orchestrated on the molecular level, and the way proteins interact with one another — either with other proteins or with copies of themselves.
Recent findings have revealed that small protein fragments have a whole lot of functional potential. Despite the fact that they’re incomplete pieces, short stretches of amino acids can still bind to interfaces of a goal protein, recapitulating native interactions. Through this process, they will alter that protein’s function or disrupt its interactions with other proteins.
Protein fragments could subsequently empower each basic research on protein interactions and cellular processes, and will potentially have therapeutic applications.
Recently published in , a brand new method developed within the Department of Biology builds on existing artificial intelligence models to computationally predict protein fragments that may bind to and inhibit full-length proteins in . Theoretically, this tool may lead to genetically encodable inhibitors against any protein.
The work was done within the lab of associate professor of biology and Howard Hughes Medical Institute investigator Gene-Wei Li in collaboration with the lab of Jay A. Stein (1968) Professor of Biology, professor of biological engineering, and department head Amy Keating.
Leveraging machine learning
This system, called FragFold, leverages AlphaFold, an AI model that has led to phenomenal advancements in biology in recent times attributable to its ability to predict protein folding and protein interactions.
The goal of the project was to predict fragment inhibitors, which is a novel application of AlphaFold. The researchers on this project confirmed experimentally that greater than half of FragFold’s predictions for binding or inhibition were accurate, even when researchers had no previous structural data on the mechanisms of those interactions.
“Our results suggest that this can be a generalizable approach to seek out binding modes which can be prone to inhibit protein function, including for novel protein targets, and you need to use these predictions as a place to begin for further experiments,” says co-first and corresponding creator Andrew Savinov, a postdoc within the Li Lab. “We will really apply this to proteins without known functions, without known interactions, without even known structures, and we are able to put some credence in these models we’re developing.”
One example is FtsZ, a protein that is essential for cell division. It’s well-studied but accommodates a region that’s intrinsically disordered and, subsequently, especially difficult to review. Disordered proteins are dynamic, and their functional interactions are very likely fleeting — occurring so briefly that current structural biology tools can’t capture a single structure or interaction.
The researchers leveraged FragFold to explore the activity of fragments of FtsZ, including fragments of the intrinsically disordered region, to discover several recent binding interactions with various proteins. This leap in understanding confirms and expands upon previous experiments measuring FtsZ’s biological activity.
This progress is important partly since it was made without solving the disordered region’s structure, and since it exhibits the potential power of FragFold.
“That is one example of how AlphaFold is fundamentally changing how we are able to study molecular and cell biology,” Keating says. “Creative applications of AI methods, reminiscent of our work on FragFold, open up unexpected capabilities and recent research directions.”
Inhibition, and beyond
The researchers completed these predictions by computationally fragmenting each protein after which modeling how those fragments would bind to interaction partners they thought were relevant.
They compared the maps of predicted binding across the complete sequence to the results of those self same fragments in living cells, determined using high-throughput experimental measurements wherein hundreds of thousands of cells each produce one variety of protein fragment.
AlphaFold uses co-evolutionary information to predict folding, and typically evaluates the evolutionary history of proteins using something called multiple sequence alignments for each single prediction run. The MSAs are critical, but are a bottleneck for large-scale predictions — they will take a prohibitive period of time and computational power.
For FragFold, the researchers as an alternative pre-calculated the MSA for a full-length protein once, and used that result to guide the predictions for every fragment of that full-length protein.
Savinov, along with Keating Lab alumnus Sebastian Swanson PhD ’23, predicted inhibitory fragments of a various set of proteins along with FtsZ. Among the many interactions they explored was a posh between lipopolysaccharide transport proteins LptF and LptG. A protein fragment of LptG inhibited this interaction, presumably disrupting the delivery of lipopolysaccharide, which is a vital component of the outer cell membrane essential for cellular fitness.
“The massive surprise was that we are able to predict binding with such high accuracy and, the truth is, often predict binding that corresponds to inhibition,” Savinov says. “For each protein we’ve checked out, we’ve been in a position to find inhibitors.”
The researchers initially focused on protein fragments as inhibitors because whether a fraction could block a vital function in cells is a comparatively easy consequence to measure systematically. Looking forward, Savinov can also be fascinated about exploring fragment function outside inhibition, reminiscent of fragments that may stabilize the protein they bind to, enhance or alter its function, or trigger protein degradation.
Design, in principle
This research is a place to begin for developing a systemic understanding of cellular design principles, and what elements deep-learning models could also be drawing on to make accurate predictions.
“There’s a broader, further-reaching goal that we’re constructing towards,” Savinov says. “Now that we are able to predict them, can we use the information we’ve got from predictions and experiments to tug out the salient features to determine what AlphaFold has actually learned about what makes a great inhibitor?”
Savinov and collaborators also delved further into how protein fragments bind, exploring other protein interactions and mutating specific residues to see how those interactions change how the fragment interacts with its goal.
Experimentally examining the behavior of 1000’s of mutated fragments inside cells, an approach generally known as deep mutational scanning, revealed key amino acids which can be chargeable for inhibition. In some cases, the mutated fragments were even stronger inhibitors than their natural, full-length sequences.
“Unlike previous methods, we usually are not limited to identifying fragments in experimental structural data,” says Swanson. “The core strength of this work is the interplay between high-throughput experimental inhibition data and the anticipated structural models: the experimental data guides us towards the fragments which can be particularly interesting, while the structural models predicted by FragFold provide a particular, testable hypothesis for a way the fragments function on a molecular level.”
Savinov is worked up in regards to the way forward for this approach and its myriad applications.
“By creating compact, genetically encodable binders, FragFold opens a big selection of possibilities to govern protein function,” Li agrees. “We will imagine delivering functionalized fragments that may modify native proteins, change their subcellular localization, and even reprogram them to create recent tools for studying cell biology and treating diseases.”