Lab Manager | Run Your Lab Like a Business
A round, clear petri dish with a clear, speckled film inside is in the foreground. Two men in safety glasses and lab coats are out of focus in the background.
Image credit: Sandra Swisher, Department of Chemical Engineering, University of Michigan

A Simple and Robust Experimental Process for Protein Engineering

New easily interpretable technique can reduce costs and increase scale of protein optimization for multiple applications

by University of Michigan
Register for free to listen to this article
Listen with Speechify

A protein engineering method using simple, cost-effective experiments and machine learning models can predict which proteins will be effective for a given purpose, according to a new study by University of Michigan researchers.

The method has far-reaching potential to assemble proteins and peptides for applications from industry tools to therapeutics. For instance, this technique can help speed up the development of stabilized peptides for treating diseases in ways that current medicines can't, including improving how exclusively antibodies bind to their targets in immunotherapy.

Get training in Chemical Hygiene and earn CEUs.One of over 25 IACET-accredited courses in the Academy.
Chemical Hygiene Course

"The rules that govern how proteins work, from sequence to structure to function, are so complicated. Contributing to the interpretability of protein engineering efforts is particularly exciting," said Marshall Case, a doctoral graduate of chemical engineering at U-M and first author of the study. 

Currently, most protein engineering experiments use complex, labor-intensive methods and expensive instruments to attain very precise data. The long process limits how much data can be acquired, and the complicated methods are challenging to learn and execute—a trade-off for precision.

"Our method has shown that for many applications, you can avoid these complicated methods," said Case, now a computational biologist at Manifold Biotechnologies.

The updated method starts by sorting cells into two groups, known as binary sorting, based on whether they express a desired trait—like binding to fluorescent molecules—or not. Then, the cells are sequenced to get the underlying DNA codes for the proteins of interest. Machine learning algorithms then reduce the noise in the sequencing data to identify the best possible protein. 

"Rather than selecting the 'best book' from the library, it's like reading many books, then piecing together different pages from different stories to come up with the best book possible, even if it wasn't in your original library," said Greg Thurber, U-M associate professor of chemical engineering and corresponding author on the paper. "I was surprised to see the robustness of this technique using simple, binary sorting data."

Further enhancing its accessibility, the method uses linear machine learning models, which are easier to interpret compared to models with dozens of parameters.

"Because we can learn physical rules about how the proteins are actually working, we can use linear equations to model nonlinear protein behavior and make better drugs that way," Case said.

- This press release was originally published on the University of Michigan website