IPRI News Analyzer (or IpriNA) is a multi-agent tool for quantitative measure that was initially developed for the ANR project IPRI. Its purpose is fourfold:
Ipri News Analyzer is only available in french (which means both that the GUI language is french and that language processing assumes that french resources are collected).
(in-depth discussion on the subject can be found on the project site)
The focus of the IPRI project was to measure the influence of the online medium on pluralism versus redundancy. Two common conception are indeed either that online press is more plural (since free speech on the Internet must apply to news on the Net) and that online press is more redundant (because of the ease of plagiary). There had been several studies before, but they were qualitative, while IPRI aimed at being quantitative.
The quantitative analysis of the online press was done by monitoring the rss feeds of most general news providers, in most areas (newspaper websites, pure players, blogs and so on). IPRI news analyzer was the tool that was developed to manage the corpus of rss feeds, monitor the production of these feeds, and analyze their content.
The results of the study (covering much more than just what has been measured using IPRI News Analyzer) is available on the project site. This page is about the software.
IPRI News Analyzer has a few dependencies, such as a MySQL database and the TreeTagger software (which itself needs a Perl interpreter). An installation guide is available (if somewhat outdated).
Corpus definition, feeds item collection and analysis can be launched either separately, or simultaneously.
Corpus
application lets you define your own tags (within some limits) and more importantly your own media sources and the feeds you follow for each of them (you will then tag the feeds).Collection
lets you define what to collect and when, and whether to process the language in real time or (if collection is already too resource-intensive) in batch mode. You can then oversee the collection and get hints on why collecting some feeds may not work.Log
component lets you oversee the collection and memory usage, and gather stats on what has been stored in the database.Analysis
component lets you group news articles into subjects. Automated classification processes include using Google News subjects, and string matching-based supervised classification and clustering, and automated classification results must be manually validated. Manual classification is possible as well.Analysis
component then computes various statistics about the news subjects in relation to the feeds and their tags.(Note: the data provided as example here are partial and certainly do not accurately represent the corresponding media sources. If you wish accurate data, try it yourself!)
IPRI News Analyzer is available under the GNU GPL v3.
All related documentation is available under the CC Attribution-Noncommercial-Share Alike 3.0 Unported.
Contributors: Cyril Laitang, Elöd Egyed-Zsigmond, Samuel Gesche
Release date: 2013-10-21 (slight refactoring from the 2011 release)
An installation guide and a user guide (both in French) can be found on the project site.