← Back to NewsMonitor

About NewsMonitor

A Semantic Feed Aggregator

Created by Danny Ayers

Source code: github.com/danja/transmissions

Overview

NewsMonitor is a feed aggregator application built on the Transmissions message processing framework. It subscribes to RSS, Atom, and RDF feeds, stores their content in a SPARQL triple store, and provides a web interface for browsing aggregated posts.

Core Functionality

The application performs three primary functions:

Architecture

NewsMonitor consists of several components:

Data is stored in two named graphs:

Feed Processing Pipeline

When subscribing to a feed, NewsMonitor executes a pipeline of processors:

  1. HTTP client fetches the feed XML
  2. Feed parser extracts individual entries
  3. Deduplicator checks for existing entries using GUIDs and content hashes
  4. RDF builder converts entries to RDF triples using Nunjucks templates
  5. SPARQL updater inserts new entries into the triple store

Updates run automatically every hour by default, with the interval configurable via environment variables.

Data Model

NewsMonitor uses the SIOC (Semantically-Interlinked Online Communities) vocabulary for representing feeds and posts:

This RDF-based storage enables SPARQL queries for flexible content retrieval and integration with other semantic web applications.

Features

Technical Stack

Part of the Transmissions Framework

Learn more at github.com/danja/transmissions