[Pycon] [new paper] "Parul Sethi" - Visualizing Topic Models

info a pycon.it info a pycon.it
Sab 16 Dic 2017 18:03:16 CET


Title: Visualizing Topic Models
Duration: 45 (includes Q&A)
Q&A Session: 15
Language: en
Type: Talk

Abstract: Topic Modelling is a great way to infer topics in a large corpus of text documents but analyzing them could become difficult without any visualization. The purpose of this talk is to introduce the visualizations that aids the process of training topic models and analyze their results. I’ll give a brief introduction on Topic Models before moving to visualizations.

I’ll demonstrate the steps to train the LDA model in gensim and create the following visualizations using the trained model and how to interpret them:

1. LDA Training visualization: Monitoring the LDA model training using Visdom integration in gensim
2. pyLDAvis: Topic interpretation by utilizing the relevance metric appropriately according to datasets
3. Topic difference visualization: Discovering exact distances between every topic pair with their intersection/difference of terms
4. Dendrogram (with extended heatmap): Topic clustering with a deeper insight into the inter-topic semantic validity 
5. LDA Projections: Document clustering using tensorboard based on the topic representation of documents

Prerequisites: Brief idea of Topic Modeling could be useful but not necessary.


Tags: [u'>nlp </font></font>', u'vertical-align: inherit;', u'>data-analysis </font></font>', u'>pydata </font></font>', u'<font style=', u'><font style=', u'>data-visualization </font></font>', u'>computational-linguistics </font></font>']


Maggiori informazioni sulla lista Pycon