Page cover

Data Ingestion

  • Twitter API Integration: Uses the Twitter/X API to fetch up to 50 recent tweets, replies, and engagement data (likes, retweets, comments).

  • Data Preprocessing: Cleans and structures raw tweet data for analysis, removing noise (e.g., irrelevant mentions).

Tech Stack:

  • Twitter API v2: For real-time data access.

  • Python (Tweepy): For API interaction and data extraction.

  • Apache Kafka: For streaming tweet data in high-volume scenarios.

Last updated