Scholarly record
THE METHOD FOR SEMANTICALLY ORIENTED BIG DATA MODELLING IN ACCORDANCE WITH CONTEXT
Abstract
The existing approach to organizing big data processing requires the analyst to understand the set of initial data. This understanding presupposes the presence of knowledge not only about the list of available characteristics, their fullness, noisiness, etc., but also about the nature of their occurrence - the semantics of the data. Big data semantics can be represented using context. The context in the project refers to the activity in which the data is used or because of which was obtained. Applied scientific approaches that are relevant today, allowing to build models considering semantics and (or) context, are based on the concept of generalization - the same in meaning, but differently named data values are combined under some generalizing descriptive construction. This primarily leads to a simplified presentation of the content of the big data store, since semantics and context are used to identify homogeneous values. These models provide the analyst with a scaled-down view of the big data warehouse in terms of existing models, while the semantics and context are still hidden. The paper proposes an analytics-oriented technology for modeling big data as possible data acquisition actions. The main feature of this technology is that the initiator of filling or creating a big data warehouse is directly the analyst as the main consumer of information. As a result of such structuring of the process in the context of analysis, the need to present the data available in the warehouse will no longer make sense, since this model will be initially determined by the analyst. And the data itself, presented in the storage, will correspond to the purposes of the analysis, and thus storage will be optimized.
Publication Impact Profile
Publication details
References7
Gil, D., & Song, I.-Y. (2016). Modeling and Management of Big Data: Challenges and opportunities. Future Generation Computer Systems, 63, 96-99,
Romero O., Simitsis A., Abell? A. GEM: Requirement-driven generation of ETL and multidimensional conceptual designs // Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2011. Vol. 6862 LNCS., Pages 80-95
Bansal S.K. Towards a Semantic Extract-Transform-Load (ETL) framework for big data integration // Proceedings - 2014 IEEE International Congress on Big Data, BigData Congress 2014. 2014, Pages 522-529
Bansal S.K., Kagemann S. Integrating Big Data: A Semantic Extract-Transform-Load Framework // Computer. 2015. Vol. 48, № 3. , Pages 42-50
El Akkaoui Z. et al. BPMN-based conceptual modeling of ETL processes // Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2012. Vol. 7448 LNCS. Pages 1-14
Chen, M., Mao, S., & Liu, Y. (2014). Big Data: A Survey. Mobile Networks and Applications, 19(2), 171-209
Zicari, R. V. (2014). Big Data: Challenges and Opportunities. Big Data Computing, 103-128
View or Download full articleAccess options
SWS access login
Login as SWS Scientific CommitteeLogin as SWS Scientific PartnerLogin as SWS AuthorAuthors and approved SWS contributors will read and export their own linked papers after identity matching by SWS profile, email and SGEM GlobalID.
For librarian assistance: [email protected]
Purchase Instant Access
- Article can be downloaded after successful payment.
- Article may be used according to SWS library access terms.
- Article cannot be redistributed.

