Towards A Twitter Observatory: A Multi-Paradigm Framework For Collecting, Storing And Analysing Tweets
Affiliation auteurs | !!!! Error affiliation !!!! |
Titre | Towards A Twitter Observatory: A Multi-Paradigm Framework For Collecting, Storing And Analysing Tweets |
Type de publication | Conference Paper |
Year of Publication | 2016 |
Auteurs | Basaille I, Kirgizov S, Leclercq E, Savonnet M, Cullot N |
Editor | Espana S, Ralyte J, Souveyet C |
Conference Name | 2016 IEEE TENTH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS) |
Publisher | IEEE |
Conference Location | 345 E 47TH ST, NEW YORK, NY 10017 USA |
ISBN Number | 978-1-4799-8710-8 |
Mots-clés | knowledge discovery, massive datasets, open source software, polyglot storage, Twitter analysis |
Résumé | In this article we show how a multi-paradigm framework can fulfil the requirements of tweets analysis and reduce the waiting time for researchers that use computational resources and storage systems to support large-scale data analysis. The originality of our approach is to combine concerns about data harvesting, data storage, data analysis and data visualisation into a framework that supports inductive reasoning in multidisciplinary scientific research. Our main contribution is a polyglot storage system with a generic data model to support logical data independence and a set of tools that can provide a suitable solution for mixing different types of algorithms in order to maximise the extraction of knowledge. We describe the software architecture of our framework, the generic model and we show how it has been used in major projects and what characteristics have been validated. |