This vignette describes all the columns of data (aka return fields)
you can request from UniProt’s UniProtKB database. This is relevant for
uniprot_map() when mapping to UniProtKB and for
uniprot_search() when database = "uniprotkb".
These tables have been scraped from this page on the
UniProt website.
- 
field = string to use in the fieldsargument
- 
label = name of column that will be returned
Names & Taxonomy
| accession | Entry | 
| id | Entry Name | 
| gene_names | Gene Names | 
| gene_primary | Gene Names (primary) | 
| gene_synonym | Gene Names (synonym) | 
| gene_oln | Gene Names (ordered locus) | 
| gene_orf | Gene Names (ORF) | 
| organism_name | Organism | 
| organism_id | Organism ID | 
| protein_name | Protein names | 
| xref_proteomes | Proteomes | 
| lineage | Taxonomic lineage | 
| lineage_ids | Taxonomic lineage (IDs) | 
| virus_hosts | Virus hosts | 
 
Sequences
| cc_alternative_products | Alternative products | 
| ft_var_seq | Alternative sequence | 
| error_gmodel_pred | Erroneous gene model prediction | 
| fragment | Fragment | 
| organelle | Gene encoded by | 
| length | Length | 
| mass | Mass | 
| cc_mass_spectrometry | Mass spectrometry | 
| ft_variant | Natural variant | 
| ft_non_cons | Non-adjacent residues | 
| ft_non_std | Non-standard residue | 
| ft_non_ter | Non-terminal residue | 
| cc_polymorphism | Polymorphism | 
| cc_rna_editing | RNA editing | 
| sequence | Sequence | 
| cc_sequence_caution | Sequence caution | 
| ft_conflict | Sequence conflict | 
| ft_unsure | Sequence uncertainty | 
| sequence_version | Sequence version | 
 
Function
| absorption | Absorption | 
| ft_act_site | Active site | 
| cc_activity_regulation | Activity regulation | 
| ft_binding | Binding site | 
| cc_catalytic_activity | Catalytic activity | 
| cc_cofactor | Cofactor | 
| ft_dna_bind | DNA binding | 
| ec | EC number | 
| cc_function | Function [CC] | 
| kinetics | Kinetics | 
| cc_pathway | Pathway | 
| ph_dependence | pH dependence | 
| redox_potential | Redox potential | 
| rhea | Rhea ID | 
| ft_site | Site | 
| temp_dependence | Temperature dependence | 
 
Miscellaneous
| annotation_score | Annotation | 
| cc_caution | Caution | 
| comment_count | Comment Count | 
| feature_count | Features | 
| keywordid | Keyword ID | 
| keyword | Keywords | 
| cc_miscellaneous | Miscellaneous [CC] | 
| protein_existence | Protein existence | 
| reviewed | Reviewed | 
| tools | Tools | 
| uniparc_id | UniParc | 
 
Interaction
| cc_interaction | Interacts with | 
| cc_subunit | Subunit structure[CC] | 
 
Expression
| cc_developmental_stage | Developmental stage | 
| cc_induction | Induction | 
| cc_tissue_specificity | Tissue specificity | 
 
Gene Ontology (GO)
| go_p | Gene ontology (biological process) | 
| go_c | Gene ontology (cellular component) | 
| go | Gene ontology (GO) | 
| go_f | Gene ontology (molecular function) | 
| go_id | Gene ontology IDs | 
 
Pathology & Biotech
| cc_allergen | Allergenic properties | 
| cc_biotechnology | Biotechnological use | 
| cc_disruption_phenotype | Disruption phenotype | 
| cc_disease | Involvement in disease | 
| ft_mutagen | Mutagenesis | 
| cc_pharmaceutical | Pharmaceutical use | 
| cc_toxic_dose | Toxic dose | 
 
Subcellular location
| ft_intramem | Intramembrane | 
| cc_subcellular_location | Subcellular location[CC] | 
| ft_topo_dom | Topological domain | 
| ft_transmem | Transmembrane | 
 
PTM / Processsing
| ft_chain | Chain | 
| ft_crosslnk | Cross-link | 
| ft_disulfid | Disulfide bond | 
| ft_carbohyd | Glycosylation | 
| ft_init_met | Initiator methionine | 
| ft_lipid | Lipidation | 
| ft_mod_res | Modified residue | 
| ft_peptide | Peptide | 
| cc_ptm | Post-translational modification | 
| ft_propep | Propeptide | 
| ft_signal | Signal peptide | 
| ft_transit | Transit peptide | 
 
Structure
| structure_3d | 3D | 
| ft_strand | Beta strand | 
| ft_helix | Helix | 
| ft_turn | Turn | 
 
Date of
| date_created | Date of creation | 
| date_modified | Date of last modification | 
| date_sequence_modified | Date of last sequence modification | 
| version | Entry version | 
 
Family & Domains
| ft_coiled | Coiled coil | 
| ft_compbias | Compositional bias | 
| cc_domain | Domain[CC] | 
| ft_domain | Domain[FT] | 
| ft_motif | Motif | 
| protein_families | Protein families | 
| ft_region | Region | 
| ft_repeat | Repeat | 
| ft_zn_fing | Zinc finger | 
 
Sequence Databases
| xref_ccds | CCDS | 
| xref_embl | EMBL | 
| xref_pir | PIR | 
| xref_refseq | RefSeq | 
 
3D Structure Databases
| xref_alphafolddb | AlphaFoldDB | 
| xref_bmrb | BMRB | 
| xref_pcddb | PCDDB | 
| xref_pdb | PDB | 
| xref_pdbsum | PDBsum | 
| xref_sasbdb | SASBDB | 
| xref_smr | SMR | 
 
Protein-Protein Interaction Databases
| xref_biogrid | BioGRID | 
| xref_corum | CORUM | 
| xref_complexportal | ComplexPortal | 
| xref_dip | DIP | 
| xref_elm | ELM | 
| xref_intact | IntAct | 
| xref_mint | MINT | 
| xref_string | STRING | 
 
Chemistry Databases
| xref_bindingdb | BindingDB | 
| xref_chembl | ChEMBL | 
| xref_drugbank | DrugBank | 
| xref_drugcentral | DrugCentral | 
| xref_guidetopharmacology | GuidetoPHARMACOLOGY | 
| xref_swisslipids | SwissLipids | 
 
Protein Family/Group Databases
| xref_allergome | Allergome | 
| xref_cazy | CAZy | 
| xref_clae | CLAE | 
| xref_esther | ESTHER | 
| xref_imgt_gene-db | IMGT_GENE-DB | 
| xref_merops | MEROPS | 
| xref_moondb | MoonDB | 
| xref_moonprot | MoonProt | 
| xref_peroxibase | PeroxiBase | 
| xref_rebase | REBASE | 
| xref_tcdb | TCDB | 
| xref_unilectin | UniLectin | 
 
PTM Databases
| xref_carbonyldb | CarbonylDB | 
| xref_depod | DEPOD | 
| xref_glycosmos | GlyCosmos | 
| xref_glyconnect | GlyConnect | 
| xref_glygen | GlyGen | 
| xref_metosite | MetOSite | 
| xref_phosphositeplus | PhosphoSitePlus | 
| xref_swisspalm | SwissPalm | 
| xref_iptmnet | iPTMnet | 
 
Genetic Variation Databases
| xref_biomuta | BioMuta | 
| xref_dmdm | DMDM | 
| xref_dbsnp | dbSNP | 
 
2D Gel Databases
| xref_compluyeast-2dpage | COMPLUYEAST-2DPAGE | 
| xref_dosac-cobs-2dpage | DOSAC-COBS-2DPAGE | 
| xref_ogp | OGP | 
| xref_reproduction-2dpage | REPRODUCTION-2DPAGE | 
| xref_swiss-2dpage | SWISS-2DPAGE | 
| xref_ucd-2dpage | UCD-2DPAGE | 
| xref_world-2dpage | World-2DPAGE | 
 
Proteomic Databases
| xref_cptac | CPTAC | 
| xref_epd | EPD | 
| xref_massive | MassIVE | 
| xref_maxqb | MaxQB | 
| xref_pride | PRIDE | 
| xref_paxdb | PaxDb | 
| xref_peptideatlas | PeptideAtlas | 
| xref_promex | ProMEX | 
| xref_proteomicsdb | ProteomicsDB | 
| xref_topdownproteomics | TopDownProteomics | 
| xref_jpost | jPOST | 
 
Protocols And Materials Databases
| xref_abcd | ABCD | 
| xref_antibodypedia | Antibodypedia | 
| xref_cptc | CPTC | 
| xref_dnasu | DNASU | 
 
Genome Annotation Databases
| xref_ensembl | Ensembl | 
| xref_ensemblbacteria | EnsemblBacteria | 
| xref_ensemblfungi | EnsemblFungi | 
| xref_ensemblmetazoa | EnsemblMetazoa | 
| xref_ensemblplants | EnsemblPlants | 
| xref_ensemblprotists | EnsemblProtists | 
| xref_geneid | GeneID | 
| xref_gramene | Gramene | 
| xref_kegg | KEGG | 
| xref_mane-select | MANE-Select | 
| xref_patric | PATRIC | 
| xref_ucsc | UCSC | 
| xref_vectorbase | VectorBase | 
| xref_wbparasite | WBParaSite | 
| xref_wbparasitetranscriptprotein | WBParaSiteTranscriptProtein | 
 
Organism-Specific Databases
| xref_agr | AGR | 
| xref_arachnoserver | ArachnoServer | 
| xref_araport | Araport | 
| xref_cgd | CGD | 
| xref_ctd | CTD | 
| xref_conoserver | ConoServer | 
| xref_disgenet | DisGeNET | 
| xref_echobase | EchoBASE | 
| xref_flybase | FlyBase | 
| xref_genecards | GeneCards | 
| xref_genereviews | GeneReviews | 
| xref_hgnc | HGNC | 
| xref_hpa | HPA | 
| xref_legiolist | LegioList | 
| xref_leproma | Leproma | 
| xref_mgi | MGI | 
| xref_mim | MIM | 
| xref_maizegdb | MaizeGDB | 
| xref_malacards | MalaCards | 
| xref_niagads | NIAGADS | 
| xref_opentargets | OpenTargets | 
| xref_orphanet | Orphanet | 
| xref_pharmgkb | PharmGKB | 
| xref_pombase | PomBase | 
| xref_pseudocap | PseudoCAP | 
| xref_rgd | RGD | 
| xref_sgd | SGD | 
| xref_tair | TAIR | 
| xref_tuberculist | TubercuList | 
| xref_veupathdb | VEuPathDB | 
| xref_vgnc | VGNC | 
| xref_wormbase | WormBase | 
| xref_xenbase | Xenbase | 
| xref_zfin | ZFIN | 
| xref_dictybase | dictyBase | 
| xref_euhcvdb | euHCVdb | 
| xref_nextprot | neXtProt | 
 
Phylogenomic Databases
| xref_genetree | GeneTree | 
| xref_hogenom | HOGENOM | 
| xref_inparanoid | InParanoid | 
| xref_ko | KO | 
| xref_oma | OMA | 
| xref_orthodb | OrthoDB | 
| xref_phylomedb | PhylomeDB | 
| xref_treefam | TreeFam | 
| xref_eggnog | eggNOG | 
 
Enzyme And Pathway Databases
| xref_brenda | BRENDA | 
| xref_biocyc | BioCyc | 
| xref_pathwaycommons | PathwayCommons | 
| xref_plantreactome | PlantReactome | 
| xref_reactome | Reactome | 
| xref_sabio-rk | SABIO-RK | 
| xref_signor | SIGNOR | 
| xref_signalink | SignaLink | 
| xref_unipathway | UniPathway | 
 
Miscellaneous Databases
| xref_biogrid-orcs | BioGRID-ORCS | 
| xref_chitars | ChiTaRS | 
| xref_evolutionarytrace | EvolutionaryTrace | 
| xref_genewiki | GeneWiki | 
| xref_genomernai | GenomeRNAi | 
| xref_phi-base | PHI-base | 
| xref_pro | PRO | 
| xref_pharos | Pharos | 
| xref_rnact | RNAct | 
 
Gene Expression Databases
| xref_bgee | Bgee | 
| xref_cleanex | CleanEx | 
| xref_collectf | CollecTF | 
| xref_expressionatlas | ExpressionAtlas | 
| xref_genevisible | Genevisible | 
 
Family And Domain Databases
| xref_cdd | CDD | 
| xref_disprot | DisProt | 
| xref_gene3d | Gene3D | 
| xref_hamap | HAMAP | 
| xref_ideal | IDEAL | 
| xref_interpro | InterPro | 
| xref_panther | PANTHER | 
| xref_pirsf | PIRSF | 
| xref_prints | PRINTS | 
| xref_prosite | PROSITE | 
| xref_pfam | Pfam | 
| xref_prodom | ProDom | 
| xref_sfld | SFLD | 
| xref_smart | SMART | 
| xref_supfam | SUPFAM | 
| xref_tigrfams | TIGRFAMs |