Live
2023 – 2024
See Demo

Real-Time Financial News Sentiment Crawler

Asynchronous pipeline for ingesting finance headlines and sentiment analysis

Role

Creator / Backend Engineer

Period

2023 – 2024

Category

infra

Overview

A resilient async pipeline for collecting financial news from multiple RSS feeds (Reuters, CNBC), performing real-time sentiment analysis using Hugging Face transformers, and exporting clean JSON feeds for quant workflows. Features graceful fallbacks, robust XML/HTML parsing, persistent SQLAlchemy storage, and comprehensive test coverage with pytest.

Key Highlights

  • Async RSS ingestion with graceful fallbacks and resilience
  • Transformer-based sentiment analysis with lazy loading
  • Robust parsers handling XML/HTML inconsistencies across sources
  • Persistent SQLAlchemy ORM storage with clean PostgreSQL upgrade path
  • JSON feed export for dashboard, n8n, and trading bot integration

Tech Stack

Python
aiohttp
SQLAlchemy
PostgreSQL
Hugging Face Transformers
Pydantic
pytest

What I Learned

This project deepened my understanding of Python and aiohttp and reinforced best practices in system design and scalability. I gained valuable insights into production-grade development and performance optimization.

Chat on WhatsApp