Towards A Twitter Observatory: A Multi-Paradigm Framework For Collecting, Storing And Analysing Tweets

Affiliation auteurs!!!! Error affiliation !!!!
TitreTowards A Twitter Observatory: A Multi-Paradigm Framework For Collecting, Storing And Analysing Tweets
Type de publicationConference Paper
Year of Publication2016
AuteursBasaille I, Kirgizov S, Leclercq E, Savonnet M, Cullot N
EditorEspana S, Ralyte J, Souveyet C
Conference Name2016 IEEE TENTH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS)
PublisherIEEE
Conference Location345 E 47TH ST, NEW YORK, NY 10017 USA
ISBN Number978-1-4799-8710-8
Mots-clésknowledge discovery, massive datasets, open source software, polyglot storage, Twitter analysis
Résumé

In this article we show how a multi-paradigm framework can fulfil the requirements of tweets analysis and reduce the waiting time for researchers that use computational resources and storage systems to support large-scale data analysis. The originality of our approach is to combine concerns about data harvesting, data storage, data analysis and data visualisation into a framework that supports inductive reasoning in multidisciplinary scientific research. Our main contribution is a polyglot storage system with a generic data model to support logical data independence and a set of tools that can provide a suitable solution for mixing different types of algorithms in order to maximise the extraction of knowledge. We describe the software architecture of our framework, the generic model and we show how it has been used in major projects and what characteristics have been validated.