getAccessions {geneHummus} | R Documentation |
The getAccessions
function parses the protein page for each identifier
and extracts the accession id (usually referred as XP accession in the RefSeq
database) and the organism given by the scientific name.
The accessions_by_spp
and accessions_from_spp
functions are
convenient filters for further cleaning of getAccessions
by giving
the total number of XP accessions per species or extracting the XP
accessions for a given species, respectively.
getAccessions(protein_ids)
protein_ids |
A string vector containing protein identifiers. |
A data.frame
of protein ids including columns:
accession
organism
Jose V. Die
accessions_by_spp
to summarize the total number of
accession proteins per species.
accessions_from_spp
to filter the accession ids for
a given species
prot_ids <- c("593705262", "1379669790", "357520645", "1150156484")
getAccessions(prot_ids)