SEMANTIC HMC: A PREDICTIVE MODEL USING MULTI-LABEL CLASSIFICATION FOR BIG DATA
Affiliation auteurs | !!!! Error affiliation !!!! |
Titre | SEMANTIC HMC: A PREDICTIVE MODEL USING MULTI-LABEL CLASSIFICATION FOR BIG DATA |
Type de publication | Conference Paper |
Year of Publication | 2015 |
Auteurs | Peixoto R, Hassan T, Cruz C, Bertaux A, Silva N |
Conference Name | 2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 2 |
Publisher | IEEE; IEEE COMP SOC; IEEE Tech Comm Scalable Comp; Aalto Univ, Sch Elect Engn; Integrated Serv Networks, State Key Lab; NOKIA; SSH; ERICSSON; Tekes; Federat Finnish Learned Soc; Xidian Univ |
Conference Location | 345 E 47TH ST, NEW YORK, NY 10017 USA |
ISBN Number | 978-1-4673-7952-6 |
Mots-clés | big data, Classification, Machine learning, multi-classify, ontology, semantic technologies |
Résumé | One of the biggest challenges in Big Data is the exploitation of Value from large volume of data. To exploit value one must focus on extracting knowledge from Big Data sources. In this paper we present a new simple but highly scalable process to automatically learn the label hierarchy from huge sets of unstructured text. We aim to extract knowledge from these sources using a Hierarchical Multi-Label Classification process called Semantic HMC. Five steps compose the Semantic HMC: Indexation, Vectorization, Hierarchization, Resolution and Realization. The first three steps construct the label hierarchy from data sources. The last two steps classify new items according to the hierarchy labels. To perform the classification without heavily relying on the user, the process is unsupervised, where no thesaurus or label examples are required. The process is implemented in a scalable and distributed platform to process Big Data. |
DOI | 10.1109/Trustcom.2015.578 |