AlphaFold DB provides open access to over 200 million protein structure predictions to accelerate scientific research.

Background

AlphaFold is an AI system developed by Google DeepMind that predicts a protein’s 3D structure from its amino acid sequence. It regularly achieves accuracy competitive with experiment.

Google DeepMind and EMBL’s European Bioinformatics Institute (EMBL-EBI) have partnered to create AlphaFold DB to make these predictions freely available to the scientific community. The latest database release contains over 200 million entries, providing broad coverage of UniProt (the standard repository of protein sequences and annotations). We provide individual downloads for the human proteome and for the proteomes of 47 other key organisms important in research and global health. We also provide a download for the manually curated subset of UniProt (Swiss-Prot).

image1

Q8I3H7: May protect the malaria parasite against attack by the immune system. Mean pLDDT 85.57.

View protein

In CASP14, AlphaFold was the top-ranked protein structure prediction method by a large margin, producing predictions with high accuracy. While the system still has some limitations, the CASP results suggest AlphaFold has immediate potential to help us understand the structure of proteins and advance biological research.

Let us know how the AlphaFold Protein Structure Database has been useful in your research, or if you have questions not answered in the FAQs, at alphafold@deepmind.com.

If your use case isn't covered by the database, you can generate your own AlphaFold predictions using this open source code, which also supports multimer prediction.

image2

Q8W3K0: A potential plant disease resistance protein. Mean pLDDT 82.24.

View protein

What’s new?

Updated and Integrated data - October 2025

We have made significant updates to the AlphaFold Database! We've fully synchronised our database with UniProt release 2025_03, ensuring you have access to the latest protein sequences.

In addition, you can now download the multiple sequence alignments (MSAs) used to generate each prediction. This offers a rich new layer of data for your evolutionary and structural analyses.

Read full article on EMBL-EBI site

news image

AF-A3EWL3-2-F1: DNA repair protein REV1

View protein

What’s next?

We plan to continue updating the database with structures for newly discovered protein sequences, and to improve features and functionality in response to user feedback. Please follow Google DeepMind's and EMBL-EBI’s social channels for updates.

Licence and attributions

Data is available for academic and commercial use, under a CC-BY-4.0 licence.

EMBL-EBI expects attribution (e.g., in publications, services, or products) for any of its online services, databases, or software in accordance with good scientific practice.

If you use this resource, please cite the following papers:

Fleming J. et al. AlphaFold Protein Structure Database and 3D-Beacons: New Data and Capabilities. Journal of Molecular Biology, (2025)

If you use data from AlphaMissense in your work, please cite the following paper:

Cheng, J et al. Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science (2023).

AlphaFold Data Provided by GDM:

AlphaFold Data Copyright (2022) DeepMind Technologies Limited.

AlphaMissense Copyright (2023) DeepMind Technologies Limited.

EMBL-EBI training

Recorded webinar

Accessing and interpreting predicted protein structures from AlphaFold database

AlphaFold database (AlphaFold DB) provides open access to over 200 million protein structure predictions to accelerate scientific research. This...

Online tutorial

AlphaFold

Proteins are essential components of life, predicting their 3D structure enables researchers to get an insight into its function and role. AlphaFold...
View all on-demand training