Transactions on Large-Scale Data- and Knowledge-Centered Systems L

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

The LNCS journal Transactions on Large-Scale Data and Knowledge-Centered Systems focuses on data management, knowledge discovery, and knowledge processing, which are core and hot topics in computer science. Since the 1990s, the Internet has become the main driving force behind application development in all domains. An increase in the demand for resource sharing (e.g., computing resources, services, metadata, data sources) across different sites connected through networks has led to an evolution of data- and knowledge-management systems from centralized systems to decentralized systems enabling large-scale distributed applications providing high scalability.

This, the 50th issue of Transactions on Large-Scale Data and Knowledge-Centered Systems, contains five fully revised selected regular papers. Topics covered include data anonymization, quasi-identifier discovery methods, symbolic time series representation, detection of anomalies in time series, data quality management in biobanks, and the use of multi-agent technology in the design of intelligent systems for maritime transport.



Author(s): Abdelkader Hameurlain (editor), A Min Tjoa (editor)
Publisher: Springer
Year: 2021

Language: English
Pages: 128

Preface
Organization
Contents
A Parallel Quasi-identifier Discovery Scheme for Dependable Data Anonymisation
1 Introduction
2 Discovering Quasi-identifiers
2.1 Quasi-identifiers
2.2 Enumeration and NP-Completeness
2.3 Characterising QIDs
2.4 Computing QIDs Sequentially
2.5 The Find-QID Scheme
2.6 Discovering QIDs
2.7 Complexity Analysis - Discussion
3 Attack Model
4 Empirical Model
4.1 QID Discovery and Processing
4.2 Implementing Find-QID
4.3 Parallelism
4.4 Performance Comparison on GPU vs. CPU Architectures
4.5 Attack Model Evaluation
5 Related Work
6 Conclusions and Future Work
References
Towards Symbolic Time Series Representation Improved by Kernel Density Estimators
1 Introduction
2 Related Work
2.1 Symbolic Representation - SAX
2.2 Techniques for Distribution Estimation
3 Distribution-Wise SAX (edwSAX)
3.1 Probability Density Estimation
3.2 Breakpoints and Centroids Vector Calculation
3.3 Distance Measure
4 Experiments and Evaluation
4.1 Evaluation Data Sets Description
4.2 Tightness of Lower Bound
4.3 Reconstruction Error
5 Conclusion and Future Work
References
Anomaly Detection in Time Series
1 Introduction
2 Definitions
2.1 Time-Series Patterns
2.2 Anomalies
3 Anomaly Detection Approaches for Time Series
3.1 Statistical Based Approaches
3.2 Clustering-Based Approaches
3.3 Matrix Profile Technique
4 Conclusion
References
Designing Intelligent Marine Framework Based on Complex Adaptive System Principles
1 Introduction
2 Related Work
3 Framework Structure
4 Emergence in Complex System
4.1 Containers Tagging and Aggregation
5 Risks in Intelligent Environment
6 Discussion and Conclusion
References
Data Item Quality for Biobanks
1 Introduction
2 Background
2.1 Data Quality
2.2 Data Quality Management Systems
2.3 Data in Biobanks
2.4 Properties of Biobanks Important for Data Quality Management
3 State of the Art
3.1 Data Quality in Medical Information Systems
4 Data Quality Definition for Biobanks
4.1 Biobanks as Data Brokers
4.2 Introducing Quality Characteristics for Biobank Data
5 Data Item Characteristics
5.1 Data Item Completeness
5.2 Data Accuracy and Validity
5.3 Data Reliability
5.4 Data Consistency
5.5 Data Timeliness
5.6 Data Precision
5.7 Data Provenance
5.8 Example of Dealing with Quality Characteristics
6 Conclusions
References
Author Index