Digital Humanities Laboratory (DHLab)

The DHLab: A Data Science Center supporting Digital Humanities

To maintain the emphasis on transdisciplinarity, which is the main focus of all actions carried out through the IDEX UCAJEDI project, the Center of Modeling, Simulation, and Interactions (MSI) has created its second thematic expertise center after the MDLab: The Digital Humanities Lab (DHLab).  

The DHLab aims to strengthen ties between data scientists and researchers in social sciences and the humanities at Université Côte d’Azur.

Objectives

The DHLab is currently running two projects ("Automated Analysis of Archaeological Wood Images" and “Regional expertise alignment and firm performance”), both addressing text mining issues. The DHLab pilot phase, which was part of the Data Science core program of the UCAJEDI project, ended in 2021 with the final award of the IDEX. The DHLab now supports research and development within Université Côte d'Azur on other modeling and data processing issues related to social sciences and humanities. It provides valuable assistance in modeling, classifying and visualizing heterogeneous data.

Details on how the DHLab can help with new research projects will be posted soon.

Training

The DHLab is currently contributing to the deployment of a cycle of short training courses, organized as seminars and workshops on data science, artificial intelligence, and data integration applied to several domains, held in Nice or Sophia Antipolis.

Find out more about the MSI's short training courses and subscribe to the MSI’s newsletter!

Team

DHLab strategic committee chaired by Arnaud Zucker, Prof. of Greek language and literature, and consisting of Université Côte d’Azur representatives, is in charge of the decisions relating to DHLab’s activities, in consultation with Université Côte d'Azur's governance.
The following DHLab staff is working under the hierarchical authority of the Director of the MSI, Prof. Stéphane Descombes:

Areas of Research

The DHLab's areas of research are as follows:

  • Pre-processing algorithms and unstructured data cleansing
  • Classification, processing and modeling of heterogeneous data such as texts and interaction data (graphs)
  • Deep learning, convolutional or recurrent networks.

Projects can benefit from the databases of the following entities:

  1. Université Côte d'Azur laboratories;
  2. INSEE;
  3. European Patent Office;
  4. Microsoft Academic.

In addition, a database of anthracological images, on a national basis, is being compiled by CEPAM and will be used in the "Automated Analysis of Archaeological Wood Images" project.

Publications

Preprints

L. Vanni, M. Corneli, D. Mayaffre, F. Precioso : From text saliency to linguistic objects: learning linguistic interpretable markers with a multichannel convolutional architecture (2021). hal-03142170

Journal Papers

M. Corneli, C. Bouveyron, P. Latouche : Co-Clustering of Ordinal Data via Latent Continuous Random Variables and Not Missing at Random Entries, “Journal of Computational and Graphical Statistics”, Mars 2020.

Book Chapters

L. Vanni, M. Corneli, D. Longrée, D. Mayaffre, F. Precioso. Key Passages : From statistics to Deep Learning. D. Fioredistella Iezzi; D. Mayaffre; Michelangelo Misuraca. Text Analytics. Advances and Challenges, Springer, pp.41-54, 2020, 978-3-030-52679-5. 10.1007/978-3-030-52680-1_4. hal-03099658⟩.

Conferences

L. Vanni, M. Corneli, D. Longrée, D. Mayaffre, F. Precioso. Hyperdeep : deep learning descriptif pour l'analyse de données textuelles. JADT 2020 - 15èmes Journées Internationales d'Analyse statistique des Données Textuelles, Juin 2020, Toulouse, France. hal-02926880

Contacts

Dr. Marco Corneli:  Marco.CORNELI[at]univ-cotedazur.fr 

Ongoing Research Projects

Charcoal image. Original on the left. Overlaid heatmap on the right, highlighting the regions used by the CNN for classification
Charcoal image. Original on the left. Overlaid heatmap on the right, highlighting the regions used by the CNN for classification

Former Research Projects

Figure 1: Key-patterns in Macron’s speech attributed to the classes “Trump” and “Obama”.
Figure 1: Key-patterns in Macron’s speech attributed to the classes “Trump” and “Obama”. Key-patterns in Macron’s speech attributed to the classes “Trump” and “Obama”.