Rosetta@home

From BOINC Projects
Revision as of 13:37, 29 May 2026 by Al Piskun (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search


Rosetta@home
Rosetta@home screensaver showing protein folding simulations
Project
StatusActive
CategoryBioinformatics, Protein structure prediction, Distributed computing
ComputeCPU
RequiresBOINC
Development
DeveloperBaker Laboratory
AuthorDavid Baker and collaborators
SponsorUniversity of Washington
MaintainerBaker Laboratory and RosettaCommons
Initial releaseJune 6, 2005  (21 years ago)
Repositoryhttps://github.com/RosettaCommons
Software
Written inC++, C
Operating systemWindows, Linux, macOS
SizeVaries by work unit
BOINC statistics
Stats as ofMay 22, 2026  (0 years ago)
PerformanceSeveral PFLOPS distributed across volunteer hosts
Active users25,000
Total users1,000,000
Active hosts45,000
Total hosts3,000,000
Analytics
CPU performanceLarge-scale distributed CPU processing
Metadata
Websitehttps://boinc.bakerlab.org/rosetta/
LicenseMixed proprietary and academic research licensing

Rosetta@home is a volunteer distributed computing project that uses the BOINC platform to help researchers predict and design the three-dimensional structures of proteins. The project is operated by the Baker Laboratory at the University of Washington in Seattle, Washington, and is considered one of the most scientifically successful and widely recognized BOINC projects.[1][2]

Rosetta@home officially launched in 2005 as a public volunteer computing extension of the Rosetta protein modeling software suite. Volunteers donate spare CPU resources from personal computers to perform large-scale molecular simulations involving protein folding, protein docking, and protein design.[3]

Overview

Proteins are biological macromolecules composed of amino acid chains that fold into highly complex three-dimensional structures. The function of a protein depends heavily on its final folded conformation, and predicting how proteins fold from their amino acid sequence remains one of the central problems in computational biology and biochemistry.[4]

Rosetta@home allows volunteers around the world to contribute spare computing power toward scientific research involving protein structure prediction, protein docking, computational enzyme design, and the study of molecular interactions. The project has also been used in vaccine research, antiviral therapeutic development, cancer-related protein analysis, and studies involving neurodegenerative disorders such as Alzheimer's disease, Parkinson's disease, and Huntington's disease. By distributing millions of calculations across volunteer computers, Rosetta@home enables scientific simulations that would otherwise require extremely large supercomputing facilities.

Scientific basis

Protein folding is governed by thermodynamics and molecular interactions. Rosetta software attempts to identify energetically favorable conformations by minimizing an approximate free-energy function while exploring large numbers of possible molecular arrangements.

The Rosetta platform combines several computational approaches, including Monte Carlo sampling, energy minimization, fragment assembly, comparative modeling, ab initio structure prediction, and protein docking simulations. Modern Rosetta methods also incorporate statistical and machine-learning-assisted scoring functions to improve prediction accuracy.

The Rosetta energy function attempts to minimize the free energy of candidate structures:

<math>E_{total} = \sum_i w_iE_i</math>

where <math>E_i</math> represents individual energy terms and <math>w_i</math> represents weighting coefficients applied to those terms.

The project also uses stochastic Monte Carlo methods that accept or reject conformational changes according to probabilities derived from statistical thermodynamics:

<math>P = e^{-\Delta E / kT}</math>

where <math>\Delta E</math> is the change in energy, <math>k</math> is the Boltzmann constant, and <math>T</math> is temperature.

Examples of protein structures

History

The Rosetta software project originated during the late 1990s at the Baker Laboratory under the leadership of Professor David Baker. Early versions of Rosetta focused primarily on ab initio protein structure prediction and rapidly gained recognition within computational biology research communities.[5]

Rosetta@home became publicly available through BOINC in 2005 and quickly attracted a large international volunteer community. During the late 2000s and early 2010s, the project became one of the flagship scientific applications within the BOINC ecosystem.[6]

The project experienced substantial growth during major scientific initiatives involving influenza and HIV research, CASP protein structure prediction competitions, and the development of computational protein design methods. Public participation increased dramatically again during the COVID-19 pandemic as global attention focused on antiviral research and computational biology.

CASP participation

Rosetta methods achieved significant success in the CASP competitions, which evaluate computational protein structure prediction methods using experimentally determined structures that have not yet been publicly released.

Performance in CASP competitions helped establish Rosetta as one of the leading protein prediction frameworks in computational biology.[7]

Methods

Protein docking simulation example

Rosetta@home distributes small computational tasks known as work units to volunteer computers through BOINC. Each work unit evaluates different possible conformations or molecular interactions involving proteins, with completed results returned to project servers for further scientific analysis.

The Rosetta software suite contains multiple specialized scientific modules for different forms of biomolecular modeling. Ab initio methods attempt to predict protein structures directly from amino acid sequences without relying entirely on experimentally solved templates. Protein docking simulations study how proteins interact with other proteins or molecules, while RosettaDesign allows researchers to computationally create entirely new proteins not found in nature.[8][9]

Many Rosetta methods use libraries of experimentally observed protein fragments during conformational searches. This fragment-based approach significantly reduces the complexity of the protein-folding problem while improving the likelihood of identifying physically realistic structures.

COVID-19 research

Illustration of protein folding pathways

Rosetta@home became heavily involved in COVID-19 research beginning in early 2020. Public awareness of the project increased dramatically during the pandemic as volunteers contributed substantial additional computing power toward urgent SARS-CoV-2 research efforts.[10]

Researchers used Rosetta software to study viral protein structures, investigate spike-protein interactions, and design synthetic mini-proteins capable of binding tightly to the SARS-CoV-2 spike protein. Some of these engineered proteins demonstrated strong neutralizing capabilities in laboratory studies and were investigated as potential antiviral therapeutics and diagnostic tools.[11]

The project received substantial international media coverage during this period, resulting in large increases in volunteer participation and overall BOINC activity.[12]

RosettaCommons

Illustration of SARS-CoV-2

The broader Rosetta software ecosystem is maintained by RosettaCommons, an international consortium of universities, medical research institutes, and scientific organizations collaborating on computational structural biology software development.[13]

RosettaCommons coordinates development of the Rosetta biomolecular modeling framework and supports scientific workshops, educational resources, and collaborative research initiatives. The consortium has played a major role in advancing computational protein design and structural bioinformatics, and Rosetta software is now widely used throughout the international molecular biology research community.

Project team and sponsors

University of Washington campus

Rosetta@home is operated primarily by the Baker Laboratory at the University of Washington in Seattle, Washington. The project was founded by Professor David Baker, whose research group became internationally recognized for advances in protein structure prediction and computational protein design.

In addition to the Baker Laboratory, Rosetta@home benefits from contributions by RosettaCommons scientists and researchers from numerous universities and scientific institutions around the world. The collaborative nature of the project has made Rosetta one of the largest and most influential computational biology frameworks developed through academic research partnerships.

System requirements

Rosetta@home supports Microsoft Windows, Linux, and macOS operating systems and primarily performs CPU-based scientific calculations rather than GPU acceleration. Work units may run for several hours depending on processor performance and user-selected runtime settings, and some tasks can require moderate to high levels of system memory.

The BOINC platform allows volunteers to configure CPU utilization, network scheduling, temperature limits, disk usage quotas, and other operational settings. Rosetta@home applications also support checkpointing, allowing computations to resume after interruptions or system restarts.

Community

Rosetta@home has maintained a large and active international volunteer community since its launch in 2005. Volunteers commonly participate through BOINC teams, distributed computing forums, Reddit communities, and statistics aggregation websites such as BOINCstats and Free-DC.

The project has historically been one of the most visible and competitive projects within the BOINC ecosystem, with many volunteer teams contributing substantial computing resources during community competitions and distributed computing challenges. Historical BOINC forums and archived discussions show Rosetta@home frequently ranking among the largest volunteer computing projects of its era.

Scientific impact

Rosetta@home has contributed to major scientific advances in protein structure prediction, computational enzyme engineering, structural bioinformatics, antiviral therapeutic design, and synthetic protein development. Research performed using Rosetta methods has helped establish computational protein design as a major field within modern molecular biology.

The project achieved particular recognition through strong performances in CASP protein structure prediction competitions and through the development of novel synthetic proteins and antiviral binders. During the COVID-19 pandemic, Rosetta-related research became widely known for its work involving SARS-CoV-2 spike-protein inhibitors and de novo designed mini-proteins.

Scientific publications related to Rosetta@home and the Rosetta software suite are archived through BOINC and RosettaCommons publication databases.[14]

Rendered protein structure

Scientific publications

Rosetta-related research has produced hundreds of peer-reviewed scientific papers published in journals including Nature, Science, Proceedings of the National Academy of Sciences, Journal of Molecular Biology, and Proteins.

Selected publications include:

  • Simons, K. T..(1997}).Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. Journal of Molecular Biology. DOI: 10.1006/jmbi.1997.0959.
  • Kuhlman, Brian.(2003}).Design of a novel globular protein fold with atomic-level accuracy. Science. DOI: 10.1126/science.1089427.
  • Gray, Jeffrey J..(2003}).Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. Journal of Molecular Biology. DOI: 10.1016/S0022-2836(03)00670-3.
  • Das, Rhiju.(2008}).Macromolecular Modeling with Rosetta. Annual Review of Biochemistry. DOI: 10.1146/annurev.biochem.77.062906.171838.
  • Cao, Longxing.(2021}).De novo design of picomolar SARS-CoV-2 miniprotein inhibitors. Nature. DOI: 10.1038/s41586-021-03819-2.

Additional publication lists are available through the BOINC publications archive and the RosettaCommons publications database.

See also

External links

BOINC logo
BOINC logo

References

  1. Rosetta@home.
  2. Rosetta@home.
  3. Das, Rhiju.(2008}).Macromolecular Modeling with Rosetta. Annual Review of Biochemistry. pp. 363–382. DOI: 10.1146/annurev.biochem.77.062906.171838.
  4. Protein folding.
  5. Simons, K. T..(1997}).Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. Journal of Molecular Biology. pp. 209–225. DOI: 10.1006/jmbi.1997.0959.
  6. Archived Rosetta@home pages.
  7. Moult, John.(2019}).Critical assessment of methods of protein structure prediction (CASP): Round XIII. Proteins. pp. 1011–1020. DOI: 10.1002/prot.25823.
  8. Gray, Jeffrey J..(2003}).Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. Journal of Molecular Biology. pp. 281–299. DOI: 10.1016/S0022-2836(03)00670-3.
  9. Kuhlman, Brian.(2003}).Design of a novel globular protein fold with atomic-level accuracy. Science. pp. 1364–1368. DOI: 10.1126/science.1089427.
  10. Institute for Protein Design COVID-19 research.
  11. Cao, Longxing.(2021}).De novo design of picomolar SARS-CoV-2 miniprotein inhibitors. Nature. pp. 551–556. DOI: 10.1038/s41586-021-03819-2.
  12. r/BOINC discussions.
  13. About RosettaCommons.
  14. BOINC scientific publications.