Data analysis on big data applications with small samples and incomplete information

Soizic Linford, Benjamin Bogdanovic, Kuo Ming Chao, Sladana Jankovic, Vladislav Maras, Mirjana Bugarinovic, Ilias Trochidis

Research output: Chapter in Book/Report/Conference proceedingConference proceeding

Abstract

The EU and other public organizations at different levels of national and local government across the world have funded and invested in numerous research and development projects on big data transport applications over last few years. The mid and long term effectiveness of these applications is very difficult to measure, and the benefits and usability of these applications are not easy to calculate. NOESIS, funded under EU H2020 program, aims to design a decision supported tool by gathering and analyzing these applications as use cases to formulate sufficient knowledge for policy makers to make informed decisions for their big data transport applications. The challenges in this work are associated with a small number of samples, with incomplete information, but having a good size of features that need to be analyzed to make a confident enough recommendation. This paper reports various statistical and machine learning approaches used to address these challenges and their results.

Original languageEnglish
Title of host publicationProceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019
EditorsWeiming Shen, Hugo Paredes, Junzhou Luo, Jean-Paul Barthes
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages146-151
Number of pages6
ISBN (Electronic)9781728103501, 9781728103495
DOIs
Publication statusPublished - 8 Aug 2019
Event23rd IEEE International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019 - Porto, Portugal
Duration: 6 May 20198 May 2019

Publication series

NameProceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019

Conference

Conference23rd IEEE International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019
CountryPortugal
CityPorto
Period6/05/198/05/19

    Fingerprint

Keywords

  • Big data
  • Machine learning
  • Multivariate regression
  • Random forest

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Human-Computer Interaction
  • Information Systems
  • Information Systems and Management

Cite this

Linford, S., Bogdanovic, B., Chao, K. M., Jankovic, S., Maras, V., Bugarinovic, M., & Trochidis, I. (2019). Data analysis on big data applications with small samples and incomplete information. In W. Shen, H. Paredes, J. Luo, & J-P. Barthes (Eds.), Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019 (pp. 146-151). [8791927] (Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CSCWD.2019.8791927