accessions_by_spp {geneHummus} | R Documentation |
Summarizes a dataframe of protein ids and return the total number of accessions per organism.
accessions_by_spp(my_accessions)
my_accessions |
A data frame with accession protein ids and organisms |
A data.frame
of summarized results including columns:
organism, taxonomic species
N.seqs, total number of sequences
Jose V. Die
getAccessions
to create the data frame with acession
id and organism for each protein identifier.
my_prots = data.frame(accession = c("XP_014620925", "XP_003546066",
"XP_025640041", "XP_019453956", "XP_006584791", "XP_020212415",
"XP_017436622", "XP_004503803", "XP_019463844"),
organism = c("Glycine max", "Glycine max", "Arachis hypogaea",
"Lupinus angustifolius", "Glycine max", "Cajanus cajan",
"Vigna angularis", "Cicer arietinum", "Lupinus angustifolius"))
accessions_by_spp(my_prots)