Sentiment-Dynamics-of-AI-Discourse

This repository contains the code, intermediate results, and analysis materials for the study:

Cross-Context Topic Evolution and Sentiment Dynamics of AI Discourse on Weibo and Twitter

The project focuses on AI-related public discourse on Chinese Weibo and English Twitter/X, and studies their topic evolution and sentiment dynamics across time and platforms.

1. Project Overview

With the rapid development of artificial intelligence, discussions on AI applications, governance, ethics, industrial deployment, and public attitudes have expanded rapidly across social media platforms. This project constructs a cross-context analysis framework to compare AI discourse in Chinese and English social media from two aspects:

Topic evolution
Sentiment dynamics

The study uses:

Weibo corpus (Chinese)
Twitter/X corpus (English)

and combines:

DTM
LDA
BERTopic
Sentiment classification models

to analyze how AI-related discussions change over time and across platforms.

2. Research Objective

The main goals of this project are:

To identify the major AI-related topics discussed on Weibo and Twitter/X.
To trace how these topics evolve over quarterly time slices.
To compare topic structures across Chinese and English contexts.
To build sentiment classification models for both corpora.
To analyze the cross-evolution of topic intensity and sentiment polarity.

3. Data Description

Chinese corpus

Platform: Weibo
Keyword: 人工智能
Time span: 2023-01 to 2025-09
Size: approximately 43k posts

English corpus

Platform: Twitter / X
Keyword: AI
Time span: 2023-01 to 2025-09
Size: approximately 33k posts

4. Overall Framework

The overall workflow of this project is shown below.

The framework includes the following main stages:

Data collection
Data preprocessing
Exploratory analysis
Topic modeling and topic analysis
Sentiment classification and model selection
Topic-sentiment cross-evolution analysis

5. Main Methodology

5.1 Topic Modeling

This project uses three topic modeling approaches:

DTM (Dynamic Topic Model)
Used as the main model to capture topic evolution across all quarters in a unified temporal framework.
LDA (Latent Dirichlet Allocation)
Used as a comparative baseline for static quarterly topic modeling.
BERTopic
Used as another comparative baseline to evaluate topic interpretability and consistency.

5.2 Sentiment Classification

Sentiment analysis is conducted separately for Chinese and English corpora. Multiple models are compared, and the best-performing model for each language is selected.

Best Chinese sentiment model: M2
Best English sentiment model: M3

These best models are then used for downstream topic-sentiment cross-analysis.

6. Best Sentiment Models

Chinese sentiment classification

The best-performing model for the Chinese corpus is:

M2

This model is used as the final Chinese sentiment classifier for topic-level sentiment aggregation and longitudinal analysis.

English sentiment classification

The best-performing model for the English corpus is:

M3

This model is used as the final English sentiment classifier for topic-level sentiment aggregation and cross-platform comparison.

7. Environment Requirements

The main package versions used in this project are listed below:

Python: 3.10.18
numpy: 1.26.4
pandas: 2.3.3
scikit-learn: 1.7.2
torch: 2.9.0
transformers: 4.51.2
matplotlib: 3.10.7
seaborn: 0.13.2
gensim: 4.3.3

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
Compare		Compare
EN TOPIC model and sentiment analysis		EN TOPIC model and sentiment analysis
EN compare and could figure		EN compare and could figure
EN dataset		EN dataset
Exploratory Analysis		Exploratory Analysis
Topic model and sentiment analysis		Topic model and sentiment analysis
choose K		choose K
dataset		dataset
sentiment model		sentiment model
.DS_Store		.DS_Store
README.md		README.md
framework .png		framework .png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment-Dynamics-of-AI-Discourse

1. Project Overview

2. Research Objective

3. Data Description

Chinese corpus

English corpus

4. Overall Framework

5. Main Methodology

5.1 Topic Modeling

5.2 Sentiment Classification

6. Best Sentiment Models

Chinese sentiment classification

English sentiment classification

7. Environment Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sentiment-Dynamics-of-AI-Discourse

1. Project Overview

2. Research Objective

3. Data Description

Chinese corpus

English corpus

4. Overall Framework

5. Main Methodology

5.1 Topic Modeling

5.2 Sentiment Classification

6. Best Sentiment Models

Chinese sentiment classification

English sentiment classification

7. Environment Requirements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages