[Pycon] [new paper] "Parul Sethi" - Visualizing Topic Models
info a pycon.it
info a pycon.it
Sab 16 Dic 2017 18:03:16 CET
Title: Visualizing Topic Models
Duration: 45 (includes Q&A)
Q&A Session: 15
Language: en
Type: Talk
Abstract: Topic Modelling is a great way to infer topics in a large corpus of text documents but analyzing them could become difficult without any visualization. The purpose of this talk is to introduce the visualizations that aids the process of training topic models and analyze their results. I’ll give a brief introduction on Topic Models before moving to visualizations.
I’ll demonstrate the steps to train the LDA model in gensim and create the following visualizations using the trained model and how to interpret them:
1. LDA Training visualization: Monitoring the LDA model training using Visdom integration in gensim
2. pyLDAvis: Topic interpretation by utilizing the relevance metric appropriately according to datasets
3. Topic difference visualization: Discovering exact distances between every topic pair with their intersection/difference of terms
4. Dendrogram (with extended heatmap): Topic clustering with a deeper insight into the inter-topic semantic validity
5. LDA Projections: Document clustering using tensorboard based on the topic representation of documents
Prerequisites: Brief idea of Topic Modeling could be useful but not necessary.
Tags: [u'>nlp </font></font>', u'vertical-align: inherit;', u'>data-analysis </font></font>', u'>pydata </font></font>', u'<font style=', u'><font style=', u'>data-visualization </font></font>', u'>computational-linguistics </font></font>']
Maggiori informazioni sulla lista
Pycon