Features


Linguistic Inquiry and Word Count

The following list details the network and LIWC attributes utilized within this dataset. For some LIWC attributes, hyperlinks are provided to surviving Reddit posts that demonstrate the highest scoring instances for each of them.

Network

  • Source subreddit
  • Target subreddit
  • Post-ID
  • Timestamp
  • Link sentiment

Sentiment

Social Processes

  • LIWC_Social
  • LIWC_Family
  • LIWC_Friends
  • LIWC_Humans

Affective Processes

  • LIWC_Affect
  • LIWC_Posemo
  • LIWC_Negemo
  • LIWC_Anx
  • LIWC_Anger
  • LIWC_Sad

Cognitive Processes

  • LIWC_CogMech
  • LIWC_Insight
  • LIWC_Cause
  • LIWC_Discrep
  • LIWC_Tentat
  • LIWC_Certain
  • LIWC_Inhib
  • LIWC_Incl
  • LIWC_Excl

Perceptual Processes

  • LIWC_Percept
  • LIWC_See
  • LIWC_Hear = 0.238
  • LIWC_Feel

Biological Processes

Relativity & Space-Time

Personal Concerns

  • LIWC_Work
  • LIWC_Achiev
  • LIWC_Leisure
  • LIWC_Home
  • LIWC_Money
  • LIWC_Relig
  • LIWC_Death

Spoken Language Markers

  • LIWC_Assent
  • LIWC_Dissent
  • LIWC_Nonflu
  • LIWC_Filler

Linguistic Dimensions

  • LIWC_Funct
  • LIWC_Pronoun
  • LIWC_Ppron
  • LIWC_I
  • LIWC_We
  • LIWC_You
  • LIWC_SheHe
  • LIWC_They
  • LIWC_Ipron
  • LIWC_Article
  • LIWC_Verbs
  • LIWC_AuxVb
  • LIWC_Past
  • LIWC_Present
  • LIWC_Future
  • LIWC_Adverbs
  • LIWC_Prep
  • LIWC_Conj
  • LIWC_Negate
  • LIWC_Quant
  • LIWC_Numbers
  • LIWC_Swear

Surface-Level Textual Features

  • Number of characters
  • Number of characters without counting white space
  • Fraction of alphabetical characters
  • Fraction of digits
  • Fraction of uppercase characters
  • Fraction of white spaces
  • Fraction of special characters, such as comma, exclamation mark, etc.
  • Number of words
  • Number of unique works
  • Number of long words (at least 6 characters)
  • Average word length
  • Number of unique stopwords
  • Fraction of stopwords
  • Number of sentences
  • Number of long sentences (at least 10 words)
  • Average number of characters per sentence
  • Average number of words per sentence
  • Automated readability index