Predicting condensate formation by cancer-associated fusion oncoproteins

Predicting condensate formation by cancer-associated fusion oncoproteins

Many cancers are caused by fusion oncoproteins, molecules that aberrantly form when a rearrangement of DNA results in parts of two different proteins being expressed as one. Several fusion oncoproteins spontaneously form condensates inside cells that promote cancer development.

New research by St. Jude Children’s Research Hospital established a method to study this biophysical process in cells, then used that information as a launchpad to predict the behavior of other fusion oncoproteins. The findings, which offer insight into fusion oncoprotein-driven cancers, were published in Nature Communications.

While genes define everything about us, they are not immutable. Genes are made of DNA, which is constantly being read and replicated. Errors can occur, and sometimes a piece of DNA can break and reattach at a different location. This can lead to two previously independent genes being glued together, resulting in a fusion protein. These unnatural proteins retain properties of both original components, which can have disastrous consequences for cells.

“Fusion proteins have been shown to be oncogenic drivers in upwards of 15% of human cancers,” said Richard Kriwacki, Ph.D., St. Jude Department of Structural Biology. These fusion oncoproteins can interfere with cellular regulatory pathways involved in cell growth and differentiation, leading to uncontrolled cell division and cancer.

Secrets in the droplets

“We hypothesized that gaining the ability to form condensates could be linked with the oncogenic properties of fusion oncoproteins,” Kriwacki explained.

Biomolecular condensates can form through a process called liquid-liquid phase separation, in which biomolecules separate from the surrounding local environment and form their own compartment, akin to oil droplets in water. Condensates have been shown to be very powerful tools for a cell to regulate many different processes. However, when a fusion oncoprotein has the ability to form a condensate, it can wreak havoc in our cells.

Kriwacki, along with collaborators set out to uncover how interwoven fusion oncoproteins were with the process of phase separation.

The code of fusion oncoprotein condensate behavior

The researchers initially examined 166 fusion oncoproteins in cells to observe if they phase separate. Then they categorized them, which was no small feat, according to co-first author Hazheen Shirnekhi, Ph.D., St. Jude Department of Structural Biology.

“The condensates were all different sizes, different shapes, and located in different areas of the cell,” Shirnekhi said. “It was difficult for any computer program to recognize the condensates in an unbiased manner, so we had to do this manually. It took a lot of time.”

This effort revealed that 58% of the fusion oncoproteins examined formed condensates, opening the door to additional insights.

“We found that a large number of those fusion oncoproteins that form condensates, especially in the nucleus, had functional features associated with regulation of gene expression,” Kriwacki said.

“The cytoplasmic fusion oncoproteins forming condensates had functional features associated with regulation of cell signaling.” These observations suggest that the fusion oncoproteins elicit their oncogenic properties by altering gene regulation or cell signaling pathways through formation of condensates.

Machine learning reveals scope of phenomenon

In addition to those links to cellular functions, patterns began to emerge within the protein sequences of the fusion oncoproteins that form condensates. These patterns involve so-called physicochemical features, such as number of polar amino acids, charged groups or disordered regions.

“When we looked at the sequences of the condensate-forming fusion oncoproteins, we noticed features that are distinct from the condensate-negative fusion oncoproteins,” explained co-first author Swarnendu Tripathi, Ph.D., St. Jude Department of Structural Biology. “That motivated us to select 25 non-redundant features and use data science to predict whether a fusion oncoprotein forms condensates or not.”

This data science aspect allowed the researchers to use their 166-sample groundwork to train a machine-learning algorithm using those 25 features. The computational model was then applied to predict the condensate-forming behavior of ~3,000 additional fusion oncoproteins associated with different cancer types.

The model predicted that upwards of 67% of those additional fusion oncoproteins likely form condensates. The condensate-forming predictions were tested for a subset of fusion oncoproteins. “The model was shown to be 80% accurate in independent testing with fusions not used in the training,” Tripathi noted.

This research establishes the foundational framework for determining the mechanisms underlying the oncogenic properties of fusion oncoproteins to enable their targeted inhibition through pharmaceutical agents or alternative approaches. “We’re looking to address the relationship between condensate formation, alteration of gene expression and oncogenesis,” Kriwacki explained. “We’re working with collaborators so that we can address this causality question in as rigorous a way as possible.”

As Kriwacki highlighted, “By obtaining a grasp of the underlying mechanisms, we are setting the stage for potential innovative therapeutic approaches against fusion oncoprotein-driven cancers.”

More information:
Swarnendu Tripathi et al, Defining the condensate landscape of fusion oncoproteins, Nature Communications (2023). DOI: 10.1038/s41467-023-41655-2

Journal information:
Nature Communications

Source: Read Full Article