Modules:
interfaces
– Core gensim interfaces
utils
– Various utility functionsFakeDict
InputQueue
RepeatCorpus
SaveLoad
any2unicode()
any2utf8()
chunkize()
chunkize_serial()
copytree_hardlink()
deaccent()
decode_htmlentities()
dict_from_corpus()
file_or_filename()
getNS()
get_max_id()
get_my_ip()
grouper()
identity()
is_corpus()
make_closing()
pickle()
pyro_daemon()
revdict()
simple_preprocess()
synchronous()
to_unicode()
to_utf8()
tokenize()
toptexts()
unpickle()
upload_chunked()
matutils
– Math utils
corpora.bleicorpus
– Corpus in Blei’s LDA-C formatcorpora.dictionary
– Construct word<->id mappingscorpora.hashdictionary
– Construct word<->id mappingscorpora.lowcorpus
– Corpus in List-of-Words format
corpora.mmcorpus
– Corpus in Matrix Market format
corpora.svmlightcorpus
– Corpus in SVMlight format
corpora.wikicorpus
– Corpus from a Wikipedia dumpcorpora.textcorpus
– Building corpora with dictionariescorpora.ucicorpus
– Corpus in UCI bag-of-words formatcorpora.indexedcorpus
– Random access to corpus documents
models.ldamodel
– Latent Dirichlet Allocationmodels.ldamallet
– Latent Dirichlet Allocation via Malletmodels.lsimodel
– Latent Semantic Indexingmodels.tfidfmodel
– TF-IDF modelmodels.rpmodel
– Random Projectionsmodels.hdpmodel
– Hierarchical Dirichlet Processmodels.logentropy_model
– LogEntropy modelmodels.lsi_dispatcher
– Dispatcher for distributed LSImodels.lsi_worker
– Worker for distributed LSImodels.lda_dispatcher
– Dispatcher for distributed LDAmodels.lda_worker
– Worker for distributed LDAmodels.word2vec
– Deep learning with word2vecsimilarities.docsim
– Document similarity queriessimserver
– Document similarity server