Rosetta@home: Difference between revisions

From BOINC Projects
Jump to navigation Jump to search
Al Piskun (talk | contribs)
infobox
Al Piskun (talk | contribs)
update and infobox
Line 4: Line 4:
| logo caption        = Rosetta@home logo
| logo caption        = Rosetta@home logo
| screenshot          = Rosetta.gif
| screenshot          = Rosetta.gif
| caption              = Rosetta@home screensaver
| caption              = Rosetta@home screensaver showing protein folding simulations
 
| status              = Active
| status              = Active
| category            = Bioinformatics, Protein structure prediction, Distributed computing
| category            = Bioinformatics, Protein structure prediction, Distributed computing
Line 43: Line 42:
== Overview ==
== Overview ==


Proteins are biological macromolecules composed of amino acid chains that fold into highly complex three-dimensional structures. The function of a protein depends heavily on its final folded conformation. Predicting how proteins fold from their amino acid sequence remains one of the major challenges in computational biology and biochemistry.<ref>{{cite web|url=https://en.wikipedia.org/wiki/Protein_folding|title=Protein folding}}</ref>
Proteins are biological macromolecules composed of amino acid chains that fold into highly complex three-dimensional structures. The function of a protein depends heavily on its final folded conformation, and predicting how proteins fold from their amino acid sequence remains one of the central problems in computational biology and biochemistry.<ref>{{cite web|url=https://en.wikipedia.org/wiki/Protein_folding|title=Protein folding}}</ref>


Rosetta@home enables volunteers around the world to contribute computing power toward:
Rosetta@home allows volunteers around the world to contribute spare computing power toward scientific research involving protein structure prediction, protein docking, computational enzyme design, and the study of molecular interactions. The project has also been used in vaccine research, antiviral therapeutic development, cancer-related protein analysis, and studies involving neurodegenerative disorders such as [[wikipedia:Alzheimer's disease|Alzheimer's disease]], [[wikipedia:Parkinson's disease|Parkinson's disease]], and [[wikipedia:Huntington's disease|Huntington's disease]]. By distributing millions of calculations across volunteer computers, Rosetta@home enables scientific simulations that would otherwise require extremely large supercomputing facilities.
 
* Protein structure prediction
* Protein docking
* Protein interface analysis
* Computational enzyme design
* Vaccine and therapeutic research
* Antiviral protein development
* Cancer-related protein research
* Research into neurodegenerative diseases
 
Many diseases are associated with protein misfolding or protein interaction failures, including:
 
* [[wikipedia:Alzheimer's disease|Alzheimer's disease]]
* [[wikipedia:Parkinson's disease|Parkinson's disease]]
* [[wikipedia:Huntington's disease|Huntington's disease]]
* [[wikipedia:Cystic fibrosis|Cystic fibrosis]]
* Certain forms of cancer


== Scientific basis ==
== Scientific basis ==


Protein folding is governed by thermodynamics and molecular interactions. Rosetta software attempts to locate energetically favorable conformations by minimizing an approximate free-energy function.
Protein folding is governed by thermodynamics and molecular interactions. Rosetta software attempts to identify energetically favorable conformations by minimizing an approximate free-energy function while exploring large numbers of possible molecular arrangements.


The project commonly uses computational approaches including:
The Rosetta platform combines several computational approaches, including Monte Carlo sampling, energy minimization, fragment assembly, comparative modeling, ab initio structure prediction, and protein docking simulations. Modern Rosetta methods also incorporate statistical and machine-learning-assisted scoring functions to improve prediction accuracy.
 
* Monte Carlo sampling
* Energy minimization
* Fragment assembly
* Comparative modeling
* Ab initio structure prediction
* Protein docking simulations
*Machine-learning-assisted scoring functions


The Rosetta energy function attempts to minimize the free energy of candidate structures:
The Rosetta energy function attempts to minimize the free energy of candidate structures:
Line 82: Line 56:
<math>E_{total} = \sum_i w_iE_i</math>
<math>E_{total} = \sum_i w_iE_i</math>


where:
where <math>E_i</math> represents individual energy terms and <math>w_i</math> represents weighting coefficients applied to those terms.
 
* <math>E_i</math> represents individual energy terms
* <math>w_i</math> represents weighting coefficients


The project also uses stochastic Monte Carlo methods that accept or reject conformational changes according to probabilities derived from statistical thermodynamics:
The project also uses stochastic Monte Carlo methods that accept or reject conformational changes according to probabilities derived from statistical thermodynamics:
Line 91: Line 62:
<math>P = e^{-\Delta E / kT}</math>
<math>P = e^{-\Delta E / kT}</math>


where:
where <math>\Delta E</math> is the change in energy, <math>k</math> is the Boltzmann constant, and <math>T</math> is temperature.
 
* <math>\Delta E</math> is the change in energy
* <math>k</math> is the Boltzmann constant
* <math>T</math> is temperature


[[File:Protein_structure_examples.png|thumb|Examples of protein structures]]
[[File:Protein_structure_examples.png|thumb|Examples of protein structures]]
Line 101: Line 68:
== History ==
== History ==


The Rosetta software project originated during the late 1990s at the Baker Laboratory under the leadership of Professor [[wikipedia:David Baker (biochemist)|David Baker]]. Early Rosetta software focused on ab initio protein structure prediction and rapidly gained recognition within computational biology research communities.<ref>{{cite journal|last=Simons|first=K. T.|last2=Kooperberg|first2=C.|last3=Huang|first3=E.|last4=Baker|first4=D.|title=Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions|journal=Journal of Molecular Biology|year=1997|volume=268|issue=1|pages=209–225|doi=10.1006/jmbi.1997.0959}}</ref>
The Rosetta software project originated during the late 1990s at the Baker Laboratory under the leadership of Professor [[wikipedia:David Baker (biochemist)|David Baker]]. Early versions of Rosetta focused primarily on ab initio protein structure prediction and rapidly gained recognition within computational biology research communities.<ref>{{cite journal|last=Simons|first=K. T.|last2=Kooperberg|first2=C.|last3=Huang|first3=E.|last4=Baker|first4=D.|title=Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions|journal=Journal of Molecular Biology|year=1997|volume=268|issue=1|pages=209–225|doi=10.1006/jmbi.1997.0959}}</ref>
 
Rosetta@home became publicly available through BOINC in 2005 and quickly attracted a large international volunteer community. The project became one of the flagship scientific applications of the BOINC ecosystem during the late 2000s and early 2010s.<ref>{{cite web|url=https://web.archive.org/web/*/https://boinc.bakerlab.org/rosetta/|title=Archived Rosetta@home pages}}</ref>


Major growth periods occurred during:
Rosetta@home became publicly available through BOINC in 2005 and quickly attracted a large international volunteer community. During the late 2000s and early 2010s, the project became one of the flagship scientific applications within the BOINC ecosystem.<ref>{{cite web|url=https://web.archive.org/web/*/https://boinc.bakerlab.org/rosetta/|title=Archived Rosetta@home pages}}</ref>


* CASP protein structure prediction competitions
The project experienced substantial growth during major scientific initiatives involving influenza and HIV research, CASP protein structure prediction competitions, and the development of computational protein design methods. Public participation increased dramatically again during the COVID-19 pandemic as global attention focused on antiviral research and computational biology.
* Influenza and HIV research initiatives
* Development of computational protein design methods
* The COVID-19 pandemic


== CASP participation ==
== CASP participation ==


Rosetta methods achieved significant success in the [[wikipedia:Critical Assessment of protein Structure Prediction|CASP]] competitions, which evaluate computational protein structure prediction methods using experimentally determined structures not yet publicly released.
Rosetta methods achieved significant success in the [[wikipedia:Critical Assessment of protein Structure Prediction|CASP]] competitions, which evaluate computational protein structure prediction methods using experimentally determined structures that have not yet been publicly released.


Performance in CASP competitions helped establish Rosetta as one of the leading protein prediction frameworks in computational biology.<ref>{{cite journal|last=Moult|first=John|title=Critical assessment of methods of protein structure prediction (CASP): Round XIII|journal=Proteins|year=2019|volume=87|issue=12|pages=1011–1020|doi=10.1002/prot.25823}}</ref>
Performance in CASP competitions helped establish Rosetta as one of the leading protein prediction frameworks in computational biology.<ref>{{cite journal|last=Moult|first=John|title=Critical assessment of methods of protein structure prediction (CASP): Round XIII|journal=Proteins|year=2019|volume=87|issue=12|pages=1011–1020|doi=10.1002/prot.25823}}</ref>
Line 122: Line 84:
[[File:PDB 1p5t EBI.jpg|thumb|Protein docking simulation example]]
[[File:PDB 1p5t EBI.jpg|thumb|Protein docking simulation example]]


Rosetta@home distributes small computational tasks known as ''work units'' to volunteer computers through BOINC. Each work unit evaluates different possible conformations or molecular interactions involving proteins.
Rosetta@home distributes small computational tasks known as ''work units'' to volunteer computers through BOINC. Each work unit evaluates different possible conformations or molecular interactions involving proteins, with completed results returned to project servers for further scientific analysis.
 
The Rosetta software suite includes multiple scientific modules:
 
=== Ab initio structure prediction ===


Ab initio methods attempt to predict protein structures directly from amino acid sequence without relying entirely on experimentally solved templates.
The Rosetta software suite contains multiple specialized scientific modules for different forms of biomolecular modeling. Ab initio methods attempt to predict protein structures directly from amino acid sequences without relying entirely on experimentally solved templates. Protein docking simulations study how proteins interact with other proteins or molecules, while RosettaDesign allows researchers to computationally create entirely new proteins not found in nature.<ref>{{cite journal|last=Gray|first=Jeffrey J.|title=Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations|journal=Journal of Molecular Biology|year=2003|volume=331|issue=1|pages=281–299|doi=10.1016/S0022-2836(03)00670-3}}</ref><ref>{{cite journal|last=Kuhlman|first=Brian|title=Design of a novel globular protein fold with atomic-level accuracy|journal=Science|year=2003|volume=302|issue=5649|pages=1364–1368|doi=10.1126/science.1089427}}</ref>


=== Protein docking ===
Many Rosetta methods use libraries of experimentally observed protein fragments during conformational searches. This fragment-based approach significantly reduces the complexity of the protein-folding problem while improving the likelihood of identifying physically realistic structures.
 
Docking simulations attempt to determine how proteins interact with other proteins or molecules. RosettaDock is one major Rosetta subsystem dedicated to these calculations.<ref>{{cite journal|last=Gray|first=Jeffrey J.|title=Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations|journal=Journal of Molecular Biology|year=2003|volume=331|issue=1|pages=281–299|doi=10.1016/S0022-2836(03)00670-3}}</ref>
 
=== Protein design ===
 
RosettaDesign enables researchers to computationally create entirely new proteins not found in nature. These methods have been used to design enzymes, binders, and antiviral mini-proteins.<ref>{{cite journal|last=Kuhlman|first=Brian|title=Design of a novel globular protein fold with atomic-level accuracy|journal=Science|year=2003|volume=302|issue=5649|pages=1364–1368|doi=10.1126/science.1089427}}</ref>
 
=== Fragment assembly ===
 
Many Rosetta methods use libraries of known protein fragments to assemble candidate structures during conformational searches.


== COVID-19 research ==
== COVID-19 research ==
Line 146: Line 94:
[[File:Protein_folding.png|thumb|Illustration of protein folding pathways]]
[[File:Protein_folding.png|thumb|Illustration of protein folding pathways]]


Rosetta@home became heavily involved in COVID-19 research beginning in early 2020. Public awareness of the project increased dramatically during the pandemic as volunteers contributed substantial additional computing power to pandemic-related research.<ref>{{cite web|url=https://www.ipd.uw.edu/covid-19/|title=Institute for Protein Design COVID-19 research}}</ref>
Rosetta@home became heavily involved in COVID-19 research beginning in early 2020. Public awareness of the project increased dramatically during the pandemic as volunteers contributed substantial additional computing power toward urgent SARS-CoV-2 research efforts.<ref>{{cite web|url=https://www.ipd.uw.edu/covid-19/|title=Institute for Protein Design COVID-19 research}}</ref>
 
Researchers used Rosetta to:
 
* Design mini-proteins that bind the SARS-CoV-2 spike protein
* Study viral protein structures
* Develop candidate antiviral therapeutics
* Assist vaccine-related molecular research


One major achievement involved the creation of synthetic mini-proteins capable of strongly binding to the SARS-CoV-2 spike protein in laboratory studies.<ref>{{cite journal|last=Cao|first=Longxing|title=De novo design of picomolar SARS-CoV-2 miniprotein inhibitors|journal=Nature|year=2021|volume=595|issue=7867|pages=551–556|doi=10.1038/s41586-021-03819-2}}</ref>
Researchers used Rosetta software to study viral protein structures, investigate spike-protein interactions, and design synthetic mini-proteins capable of binding tightly to the SARS-CoV-2 spike protein. Some of these engineered proteins demonstrated strong neutralizing capabilities in laboratory studies and were investigated as potential antiviral therapeutics and diagnostic tools.<ref>{{cite journal|last=Cao|first=Longxing|title=De novo design of picomolar SARS-CoV-2 miniprotein inhibitors|journal=Nature|year=2021|volume=595|issue=7867|pages=551–556|doi=10.1038/s41586-021-03819-2}}</ref>


The project received significant international media coverage during this period, causing major increases in volunteer participation and BOINC activity.<ref>{{cite web|url=https://www.reddit.com/r/BOINC/|title=r/BOINC discussions}}</ref>
The project received substantial international media coverage during this period, resulting in large increases in volunteer participation and overall BOINC activity.<ref>{{cite web|url=https://www.reddit.com/r/BOINC/|title=r/BOINC discussions}}</ref>


== RosettaCommons ==
== RosettaCommons ==
Line 163: Line 104:
[[File:SARS-CoV-2_without_background.png|thumb|Illustration of SARS-CoV-2]]
[[File:SARS-CoV-2_without_background.png|thumb|Illustration of SARS-CoV-2]]


The broader Rosetta software ecosystem is maintained by [[wikipedia:RosettaCommons|RosettaCommons]], an international consortium of universities and research institutions collaborating on computational structural biology software development.<ref>{{cite web|url=https://www.rosettacommons.org/about|title=About RosettaCommons}}</ref>
The broader Rosetta software ecosystem is maintained by [[wikipedia:RosettaCommons|RosettaCommons]], an international consortium of universities, medical research institutes, and scientific organizations collaborating on computational structural biology software development.<ref>{{cite web|url=https://www.rosettacommons.org/about|title=About RosettaCommons}}</ref>


RosettaCommons includes contributors from:
RosettaCommons coordinates development of the Rosetta biomolecular modeling framework and supports scientific workshops, educational resources, and collaborative research initiatives. The consortium has played a major role in advancing computational protein design and structural bioinformatics, and Rosetta software is now widely used throughout the international molecular biology research community.
 
* Universities
* Medical research institutes
* National laboratories
* International research organizations
 
The collaboration supports development of:
 
* Rosetta biomolecular modeling software
* Educational resources
* Scientific workshops
* Protein design tools
* Molecular simulation frameworks


== Project team and sponsors ==
== Project team and sponsors ==
Line 184: Line 112:
[[File:University of Washington Red Square golden hour Seattle Washington.jpg|thumb|University of Washington campus]]
[[File:University of Washington Red Square golden hour Seattle Washington.jpg|thumb|University of Washington campus]]


Rosetta@home is operated primarily by the [https://www.bakerlab.org/ Baker Laboratory] at the [[wikipedia:University of Washington|University of Washington]] in Seattle, Washington, United States.
Rosetta@home is operated primarily by the [https://www.bakerlab.org/ Baker Laboratory] at the [[wikipedia:University of Washington|University of Washington]] in Seattle, Washington. The project was founded by Professor [[wikipedia:David Baker (biochemist)|David Baker]], whose research group became internationally recognized for advances in protein structure prediction and computational protein design.


Key researchers and contributors include:
In addition to the Baker Laboratory, Rosetta@home benefits from contributions by RosettaCommons scientists and researchers from numerous universities and scientific institutions around the world. The collaborative nature of the project has made Rosetta one of the largest and most influential computational biology frameworks developed through academic research partnerships.
 
* [[wikipedia:David Baker (biochemist)|David Baker]]
* RosettaCommons scientists
* Researchers from multiple international institutions
 
The project has collaborated with numerous scientific organizations and research groups worldwide.


== System requirements ==
== System requirements ==


Rosetta@home primarily supports:
Rosetta@home supports Microsoft Windows, Linux, and macOS operating systems and primarily performs CPU-based scientific calculations rather than GPU acceleration. Work units may run for several hours depending on processor performance and user-selected runtime settings, and some tasks can require moderate to high levels of system memory.


* Microsoft Windows
The BOINC platform allows volunteers to configure CPU utilization, network scheduling, temperature limits, disk usage quotas, and other operational settings. Rosetta@home applications also support checkpointing, allowing computations to resume after interruptions or system restarts.
* Linux
* macOS
 
The project mainly performs CPU-based scientific calculations rather than GPU acceleration.
 
Typical work units may:
 
* Run for several hours
* Use moderate to high amounts of system memory
* Require stable internet access for uploading results
* Generate checkpoint files for interrupted computations
 
BOINC allows volunteers to configure:
 
* CPU utilization limits
* Temperature and power management
* Network transfer schedules
* Runtime limits
* Disk usage quotas


== Community ==
== Community ==


[[File:Rosetta.gif|alt=Rosetta@home screensaver|thumb|Rosetta@home screensaver showing protein folding simulations]]
Rosetta@home has maintained a large and active international volunteer community since its launch in 2005. Volunteers commonly participate through BOINC teams, distributed computing forums, Reddit communities, and statistics aggregation websites such as BOINCstats and Free-DC.


Rosetta@home has maintained a large international volunteer community since its launch. Volunteers participate through:
The project has historically been one of the most visible and competitive projects within the BOINC ecosystem, with many volunteer teams contributing substantial computing resources during community competitions and distributed computing challenges. Historical BOINC forums and archived discussions show Rosetta@home frequently ranking among the largest volunteer computing projects of its era.
 
* BOINC teams
* Project message boards
* Reddit communities
* Distributed computing forums
* Statistics aggregation websites
 
Popular community resources include:
 
* BOINCstats
* Free-DC
* Team AnandTech
* Reddit BOINC communities
* Historical BOINC forums
 
The project regularly participates in distributed computing competitions and community events organized by BOINC teams.


== Scientific impact ==
== Scientific impact ==


Rosetta@home has contributed to numerous scientific advances involving:
Rosetta@home has contributed to major scientific advances in protein structure prediction, computational enzyme engineering, structural bioinformatics, antiviral therapeutic design, and synthetic protein development. Research performed using Rosetta methods has helped establish computational protein design as a major field within modern molecular biology.
 
* Protein structure prediction
* Protein engineering
* Computational enzymology
* Antiviral therapeutic development
* Vaccine research
* Structural bioinformatics
 
Notable achievements include:


* Strong performance in CASP competitions
The project achieved particular recognition through strong performances in CASP protein structure prediction competitions and through the development of novel synthetic proteins and antiviral binders. During the COVID-19 pandemic, Rosetta-related research became widely known for its work involving SARS-CoV-2 spike-protein inhibitors and de novo designed mini-proteins.
* Development of novel synthetic proteins
* Advances in computational enzyme design
* SARS-CoV-2 antiviral binder design
* Contributions to structural biology research


Scientific publications related to Rosetta@home and Rosetta software are archived through BOINC and RosettaCommons publication databases.<ref>{{cite web|url=https://boinc.berkeley.edu/pubs.php#Rosetta@home|title=BOINC scientific publications}}</ref>
Scientific publications related to Rosetta@home and the Rosetta software suite are archived through BOINC and RosettaCommons publication databases.<ref>{{cite web|url=https://boinc.berkeley.edu/pubs.php#Rosetta@home|title=BOINC scientific publications}}</ref>


[[File:Protein_structure.jpg|thumb|Rendered protein structure]]
[[File:Protein_structure.jpg|thumb|Rendered protein structure]]
Line 266: Line 140:
== Scientific publications ==
== Scientific publications ==


Rosetta-related research has produced hundreds of peer-reviewed scientific papers in journals including:
Rosetta-related research has produced hundreds of peer-reviewed scientific papers published in journals including ''Nature'', ''Science'', ''Proceedings of the National Academy of Sciences'', ''Journal of Molecular Biology'', and ''Proteins''.
 
* ''Nature''
* ''Science''
* ''Proceedings of the National Academy of Sciences''
* ''Journal of Molecular Biology''
* ''Proteins''


Selected publications include:
Selected publications include:
Line 282: Line 150:
* {{cite journal|last=Cao|first=Longxing|title=De novo design of picomolar SARS-CoV-2 miniprotein inhibitors|journal=Nature|year=2021|doi=10.1038/s41586-021-03819-2}}
* {{cite journal|last=Cao|first=Longxing|title=De novo design of picomolar SARS-CoV-2 miniprotein inhibitors|journal=Nature|year=2021|doi=10.1038/s41586-021-03819-2}}


Additional publication lists:
Additional publication lists are available through the BOINC publications archive and the RosettaCommons publications database.
 
* https://boinc.berkeley.edu/pubs.php#Rosetta@home
* https://www.rosettacommons.org/publications


== See also ==
== See also ==

Revision as of 01:50, 23 May 2026


Rosetta@home
Rosetta@home screensaver showing protein folding simulations
Project
StatusActive
CategoryBioinformatics, Protein structure prediction, Distributed computing
ComputeCPU
RequiresBOINC
Development
DeveloperBaker Laboratory
AuthorDavid Baker and collaborators
SponsorUniversity of Washington
MaintainerBaker Laboratory and RosettaCommons
Initial releaseJune 6, 2005  (21 years ago)
Repositoryhttps://github.com/RosettaCommons
Software
Written inC++, C
Operating systemWindows, Linux, macOS
SizeVaries by work unit
BOINC statistics
Stats as ofMay 22, 2026  (0 years ago)
PerformanceSeveral PFLOPS distributed across volunteer hosts
Active users25,000
Total users1,000,000
Active hosts45,000
Total hosts3,000,000
Analytics
CPU performanceLarge-scale distributed CPU processing
Metadata
Websitehttps://boinc.bakerlab.org/rosetta/
LicenseMixed proprietary and academic research licensing

[[File:{{#setmainimage:Rosettahome.png}}|alt=Rosetta@home logo|center|frameless]]

Rosetta@home is a volunteer distributed computing project that uses the BOINC platform to help researchers predict and design the three-dimensional structures of proteins. The project is operated by the Baker Laboratory at the University of Washington in Seattle, Washington, and is considered one of the most scientifically successful and widely recognized BOINC projects.[1][2]

Rosetta@home officially launched in 2005 as a public volunteer computing extension of the Rosetta protein modeling software suite. Volunteers donate spare CPU resources from personal computers to perform large-scale molecular simulations involving protein folding, protein docking, and protein design.[3]

Overview

Proteins are biological macromolecules composed of amino acid chains that fold into highly complex three-dimensional structures. The function of a protein depends heavily on its final folded conformation, and predicting how proteins fold from their amino acid sequence remains one of the central problems in computational biology and biochemistry.[4]

Rosetta@home allows volunteers around the world to contribute spare computing power toward scientific research involving protein structure prediction, protein docking, computational enzyme design, and the study of molecular interactions. The project has also been used in vaccine research, antiviral therapeutic development, cancer-related protein analysis, and studies involving neurodegenerative disorders such as Alzheimer's disease, Parkinson's disease, and Huntington's disease. By distributing millions of calculations across volunteer computers, Rosetta@home enables scientific simulations that would otherwise require extremely large supercomputing facilities.

Scientific basis

Protein folding is governed by thermodynamics and molecular interactions. Rosetta software attempts to identify energetically favorable conformations by minimizing an approximate free-energy function while exploring large numbers of possible molecular arrangements.

The Rosetta platform combines several computational approaches, including Monte Carlo sampling, energy minimization, fragment assembly, comparative modeling, ab initio structure prediction, and protein docking simulations. Modern Rosetta methods also incorporate statistical and machine-learning-assisted scoring functions to improve prediction accuracy.

The Rosetta energy function attempts to minimize the free energy of candidate structures:

<math>E_{total} = \sum_i w_iE_i</math>

where <math>E_i</math> represents individual energy terms and <math>w_i</math> represents weighting coefficients applied to those terms.

The project also uses stochastic Monte Carlo methods that accept or reject conformational changes according to probabilities derived from statistical thermodynamics:

<math>P = e^{-\Delta E / kT}</math>

where <math>\Delta E</math> is the change in energy, <math>k</math> is the Boltzmann constant, and <math>T</math> is temperature.

Examples of protein structures

History

The Rosetta software project originated during the late 1990s at the Baker Laboratory under the leadership of Professor David Baker. Early versions of Rosetta focused primarily on ab initio protein structure prediction and rapidly gained recognition within computational biology research communities.[5]

Rosetta@home became publicly available through BOINC in 2005 and quickly attracted a large international volunteer community. During the late 2000s and early 2010s, the project became one of the flagship scientific applications within the BOINC ecosystem.[6]

The project experienced substantial growth during major scientific initiatives involving influenza and HIV research, CASP protein structure prediction competitions, and the development of computational protein design methods. Public participation increased dramatically again during the COVID-19 pandemic as global attention focused on antiviral research and computational biology.

CASP participation

Rosetta methods achieved significant success in the CASP competitions, which evaluate computational protein structure prediction methods using experimentally determined structures that have not yet been publicly released.

Performance in CASP competitions helped establish Rosetta as one of the leading protein prediction frameworks in computational biology.[7]

Methods

Protein docking simulation example

Rosetta@home distributes small computational tasks known as work units to volunteer computers through BOINC. Each work unit evaluates different possible conformations or molecular interactions involving proteins, with completed results returned to project servers for further scientific analysis.

The Rosetta software suite contains multiple specialized scientific modules for different forms of biomolecular modeling. Ab initio methods attempt to predict protein structures directly from amino acid sequences without relying entirely on experimentally solved templates. Protein docking simulations study how proteins interact with other proteins or molecules, while RosettaDesign allows researchers to computationally create entirely new proteins not found in nature.[8][9]

Many Rosetta methods use libraries of experimentally observed protein fragments during conformational searches. This fragment-based approach significantly reduces the complexity of the protein-folding problem while improving the likelihood of identifying physically realistic structures.

COVID-19 research

Illustration of protein folding pathways

Rosetta@home became heavily involved in COVID-19 research beginning in early 2020. Public awareness of the project increased dramatically during the pandemic as volunteers contributed substantial additional computing power toward urgent SARS-CoV-2 research efforts.[10]

Researchers used Rosetta software to study viral protein structures, investigate spike-protein interactions, and design synthetic mini-proteins capable of binding tightly to the SARS-CoV-2 spike protein. Some of these engineered proteins demonstrated strong neutralizing capabilities in laboratory studies and were investigated as potential antiviral therapeutics and diagnostic tools.[11]

The project received substantial international media coverage during this period, resulting in large increases in volunteer participation and overall BOINC activity.[12]

RosettaCommons

Illustration of SARS-CoV-2

The broader Rosetta software ecosystem is maintained by RosettaCommons, an international consortium of universities, medical research institutes, and scientific organizations collaborating on computational structural biology software development.[13]

RosettaCommons coordinates development of the Rosetta biomolecular modeling framework and supports scientific workshops, educational resources, and collaborative research initiatives. The consortium has played a major role in advancing computational protein design and structural bioinformatics, and Rosetta software is now widely used throughout the international molecular biology research community.

Project team and sponsors

University of Washington campus

Rosetta@home is operated primarily by the Baker Laboratory at the University of Washington in Seattle, Washington. The project was founded by Professor David Baker, whose research group became internationally recognized for advances in protein structure prediction and computational protein design.

In addition to the Baker Laboratory, Rosetta@home benefits from contributions by RosettaCommons scientists and researchers from numerous universities and scientific institutions around the world. The collaborative nature of the project has made Rosetta one of the largest and most influential computational biology frameworks developed through academic research partnerships.

System requirements

Rosetta@home supports Microsoft Windows, Linux, and macOS operating systems and primarily performs CPU-based scientific calculations rather than GPU acceleration. Work units may run for several hours depending on processor performance and user-selected runtime settings, and some tasks can require moderate to high levels of system memory.

The BOINC platform allows volunteers to configure CPU utilization, network scheduling, temperature limits, disk usage quotas, and other operational settings. Rosetta@home applications also support checkpointing, allowing computations to resume after interruptions or system restarts.

Community

Rosetta@home has maintained a large and active international volunteer community since its launch in 2005. Volunteers commonly participate through BOINC teams, distributed computing forums, Reddit communities, and statistics aggregation websites such as BOINCstats and Free-DC.

The project has historically been one of the most visible and competitive projects within the BOINC ecosystem, with many volunteer teams contributing substantial computing resources during community competitions and distributed computing challenges. Historical BOINC forums and archived discussions show Rosetta@home frequently ranking among the largest volunteer computing projects of its era.

Scientific impact

Rosetta@home has contributed to major scientific advances in protein structure prediction, computational enzyme engineering, structural bioinformatics, antiviral therapeutic design, and synthetic protein development. Research performed using Rosetta methods has helped establish computational protein design as a major field within modern molecular biology.

The project achieved particular recognition through strong performances in CASP protein structure prediction competitions and through the development of novel synthetic proteins and antiviral binders. During the COVID-19 pandemic, Rosetta-related research became widely known for its work involving SARS-CoV-2 spike-protein inhibitors and de novo designed mini-proteins.

Scientific publications related to Rosetta@home and the Rosetta software suite are archived through BOINC and RosettaCommons publication databases.[14]

Rendered protein structure

Scientific publications

Rosetta-related research has produced hundreds of peer-reviewed scientific papers published in journals including Nature, Science, Proceedings of the National Academy of Sciences, Journal of Molecular Biology, and Proteins.

Selected publications include:

  • Simons, K. T..(1997}).Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. Journal of Molecular Biology. DOI: 10.1006/jmbi.1997.0959.
  • Kuhlman, Brian.(2003}).Design of a novel globular protein fold with atomic-level accuracy. Science. DOI: 10.1126/science.1089427.
  • Gray, Jeffrey J..(2003}).Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. Journal of Molecular Biology. DOI: 10.1016/S0022-2836(03)00670-3.
  • Das, Rhiju.(2008}).Macromolecular Modeling with Rosetta. Annual Review of Biochemistry. DOI: 10.1146/annurev.biochem.77.062906.171838.
  • Cao, Longxing.(2021}).De novo design of picomolar SARS-CoV-2 miniprotein inhibitors. Nature. DOI: 10.1038/s41586-021-03819-2.

Additional publication lists are available through the BOINC publications archive and the RosettaCommons publications database.

See also

External links

BOINC logo
BOINC logo

References

  1. Rosetta@home.
  2. Rosetta@home.
  3. Das, Rhiju.(2008}).Macromolecular Modeling with Rosetta. Annual Review of Biochemistry. pp. 363–382. DOI: 10.1146/annurev.biochem.77.062906.171838.
  4. Protein folding.
  5. Simons, K. T..(1997}).Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. Journal of Molecular Biology. pp. 209–225. DOI: 10.1006/jmbi.1997.0959.
  6. Archived Rosetta@home pages.
  7. Moult, John.(2019}).Critical assessment of methods of protein structure prediction (CASP): Round XIII. Proteins. pp. 1011–1020. DOI: 10.1002/prot.25823.
  8. Gray, Jeffrey J..(2003}).Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. Journal of Molecular Biology. pp. 281–299. DOI: 10.1016/S0022-2836(03)00670-3.
  9. Kuhlman, Brian.(2003}).Design of a novel globular protein fold with atomic-level accuracy. Science. pp. 1364–1368. DOI: 10.1126/science.1089427.
  10. Institute for Protein Design COVID-19 research.
  11. Cao, Longxing.(2021}).De novo design of picomolar SARS-CoV-2 miniprotein inhibitors. Nature. pp. 551–556. DOI: 10.1038/s41586-021-03819-2.
  12. r/BOINC discussions.
  13. About RosettaCommons.
  14. BOINC scientific publications.