Advanced Statistical Methods for the Analysis of Large Data-Sets

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

The theme of the meeting was “Statistical Methods for the Analysis of Large Data-Sets”. In recent years there has been increasing interest in this subject; in fact a huge quantity of information is often available but standard statistical techniques are usually not well suited to managing this kind of data. The conference serves as an important meeting point for European researchers working on this topic and a number of European statistical societies participated in the organization of the event.

The book includes 45 papers from a selection of the 156 papers accepted for presentation and discussed at the conference on “Advanced Statistical Methods for the Analysis of Large Data-sets.”

Author(s): Laura Bocci, Isabella Mingo (auth.), Agostino Di Ciaccio, Mauro Coli, Jose Miguel Angulo Ibanez (eds.)
Series: Studies in Theoretical and Applied Statistics / Selected Papers of the Statistical Societies
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2012

Language: English
Pages: 486
Tags: Statistical Theory and Methods; Statistics and Computing/Statistics Programs; Statistics for Business/Economics/Mathematical Finance/Insurance

Front Matter....Pages i-xiii
Front Matter....Pages 1-1
Clustering Large Data Set: An Applied Comparative Study....Pages 3-12
Clustering in Feature Space for Interesting Pattern Identification of Categorical Data....Pages 13-22
Clustering Geostatistical Functional Data....Pages 23-31
Joint Clustering and Alignment of Functional Data: An Application to Vascular Geometries....Pages 33-43
Front Matter....Pages 45-45
Bayesian Methods for Time Course Microarray Analysis: From Genes’ Detection to Clustering....Pages 47-56
Longitudinal Analysis of Gene Expression Profiles Using Functional Mixed-Effects Models....Pages 57-67
A Permutation Solution to Compare Two Hepatocellular Carcinoma Markers....Pages 69-78
Front Matter....Pages 79-79
Statistical Perspective on Blocking Methods When Linking Large Data-sets....Pages 81-89
Integrating Households Income Microdata in the Estimate of the Italian GDP....Pages 91-99
The Employment Consequences of Globalization: Linking Data on Employers and Employees in the Netherlands....Pages 101-111
Applications of Bayesian Networks in Official Statistics....Pages 113-123
Front Matter....Pages 125-125
A Correlated Random Effects Model for Longitudinal Data with Non-ignorable Drop-Out: An Application to University Student Performance....Pages 127-136
Risk Analysis Approaches to Rank Outliers in Trade Data....Pages 137-144
Problems and Challenges in the Analysis of Complex Data: Static and Dynamic Approaches....Pages 145-157
Ensemble Support Vector Regression:A New Non-parametric Approach for Multiple Imputation....Pages 159-168
Front Matter....Pages 169-169
On the Use of PLS Regression for Forecasting Large Sets of Cointegrated Time Series....Pages 171-179
Large-Scale Portfolio Optimisation with Heuristics....Pages 181-192
Detecting Short-Term Cycles in Complex Time Series Databases....Pages 193-204
Assessing the Beneficial Effects of Economic Growth: The Harmonic Growth Index....Pages 205-215
Time Series Convergence within I(2) Models: the Case of Weekly Long Term Bond Yields in the Four Largest Euro Area Countries....Pages 217-226
Front Matter....Pages 227-227
Anthropogenic CO 2 Emissions and Global Warming: Evidence from Granger Causality Analysis....Pages 229-239
Temporal and Spatial Statistical Methods to Remove External Effects on Groundwater Levels....Pages 241-251
Reduced Rank Covariances for the Analysis of Environmental Data....Pages 253-263
Radon Level in Dwellings and Uranium Content in Soil in the Abruzzo Region: A Preliminary Investigation by Geographically Weighted Regression....Pages 265-275
Front Matter....Pages 277-277
Applications of Large Deviations to Hidden Markov Chains Estimation....Pages 279-285
Multivariate Tail Dependence Coefficients for Archimedean Copulae....Pages 287-296
A Note on Density Estimation for Circular Data....Pages 297-304
Markov Bases for Sudoku Grids....Pages 305-315
Front Matter....Pages 317-317
Estimating the Probability of Moonlighting in Italian Building Industry....Pages 319-328
Use of Interactive Plots and Tables for Robust Analysis of International Trade Data....Pages 329-338
Generational Determinants on the Employment Choice in Italy....Pages 339-349
Route-Based Performance Evaluation Using Data Envelopment Analysis Combined with Principal Component Analysis....Pages 351-360
Front Matter....Pages 361-361
Web Surveys: Methodological Problems and Research Perspectives....Pages 363-373
Semantic Based DCM Models for Text Classification....Pages 375-384
Probabilistic Relational Models for Operational Risk: A New Application Area and an Implementation Using Domain Ontologies....Pages 385-395
Front Matter....Pages 397-397
Efficient Statistical Sample Designs in a GIS for Monitoring the Landscape Changes....Pages 399-407
Studying Foreigners’ Migration Flows Through a Network Analysis Approach....Pages 409-417
Estimation of Income Quantiles at the Small Area Level in Tuscany....Pages 419-428
The Effects of Socioeconomic Background and Test-taking Motivation on Italian Students’ Achievement....Pages 429-440
Front Matter....Pages 441-441
Firm Size Dynamics in an Industrial District: The Mover-Stayer Model in Action....Pages 443-452
Front Matter....Pages 441-441
Multiple Correspondence Analysis for the Quantification and Visualization of Large Categorical Data Sets....Pages 453-463
Multivariate Ranks-Based Concordance Indexes....Pages 465-473
Methods for Reconciling the Micro and the Macro in Family Demography Research: A Systematisation....Pages 475-484