Conversation Analyzer - An Introduction

In this scope, a conversation is simply a textual interaction between two or more participants (or senders).

Basic Length Stats

An heatmap can provide an immediate view of a conversation’s feature through time. Here we can see the total length of messages over a 10 month period

Interval Stats

Aggregation

Lexical Stats

Lexical richness variation by year, aggregated by month

Words Frequency

Sender-specific frequency of three example words, aggregated by hour
  • TF (term frequency) = sender w-count/total w-count
  • IDF (inverse document frequency) = log(number of participants/number of participant who used w)
Boxplot showing the usage-by-sender (word count) of a specific word on a 10 months period
(1) Frequency-count plot for the top 20 words (2) and for all words, on a log-log scale

Emoticon Stats

Conclusions

--

--

--

Data Scientist @ Zalando Dublin - Machine Learning, Computer Vision and Everything Generative ❤

Love podcasts or audiobooks? Learn on the go with our new app.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
5agado

5agado

Data Scientist @ Zalando Dublin - Machine Learning, Computer Vision and Everything Generative ❤

More from Medium

What Does it Take to Make it Into the NBA Hall of Fame?

Data Science for Babies

Are YOU gonna attend your medical appointment? If not who is to blame for this?

Is your name dying out? Check out for yourself.