Damien Graux, PhD, HDR

~ Principal Research Scientist ~

Short Bio.

Since December 2022, I am working as a principal research scientist for Huawei Technologies Ltd. in the United Kingdom. In particular, I am working in the Knowledge Graph lab. where we conduct cutting-edge research on knowledge computing challenges.


From January 2021 to December 2022, I was a tenured researcher having an Inria starting faculty position. Practically, I was working in the Wimmics group, based in Sophia Antipolis, ca. Nice (France). I mainly focused on exploring downstream use-cases once Knowledge Graphs are built: by developing, for instance, novel visualisations to help Semantic Web lay-users accessing graphs or by setting-up analytics strategies on Knowledge Graphs through embeddings.


From October 2019 to December 2020, I was a Research Fellow at Trinity College Dublin (Ireland) working in the ADAPT Centre under the lead of Prof. Declan O'Sullivan. Practically, I was contributing to research efforts in Semantic Web technologies: mainly focusing on analyzing large distributed knowledge graphs and on designing complex transformation pipelines for heterogeneous Big Data. In particular, from July 2020, I could focus on these research topics thanks to a Marie Skłodowska-Curie ELITE-S fellowship.


From January 2018 to September 2019, I was a Senior Researcher at the Fraunhofer IAIS in Sankt Augustin (Germany, close to Bonn) focusing on the domain of Semantic Web and Linked Data in the context of large-scale datasets. My research topics include ontology management, ontology engineering, Semantic Web, Linked Data, clustering, machine learning methods. I also applied the results of my research in various European and Industry-funded projects. In parallel, I was an associated postdoc researcher of the Smart Data Analytics group at the University of Bonn, under the lead of Prof. Jens Lehmann.


In 2017, as a postdoc, still with the Tyrex group (in Inria, France), I pushed further what I developed during my PhD thesis by integrating SPARQL evaluators into larger systems where various kinds of data structures are involved: several query results are needed (and aggregated) to build a complex answer. More specifically, I was trying to design efficient languages to facilitate the development of optimized ETL pipelines in a semantic context.


From 2013 to 2016, during my PhD thesis at Inria, with the Tyrex group in Grenoble, I focused on Semantic Web standards, especially on the Resource Description Framework RDF and its dedicated query language SPARQL. My main goal was to design efficient tools to evaluate SPARQL queries on very large RDF datasets (i.e. ≥100GB). Indeed, I provided a new reading grid to rank SPARQL evaluators before designing several efficient ones.

As a past time during my PhD main activities, I also designed a semantic pipeline for trip planning aggregating heterogeneous datasets (e.g. GTFS, RDF, CSV) in order to provide users touristic alternatives at plane stopovers.


Previously, before 2013, I worked on designing and implementing broadcast algorithms with special properties such as UTO (uniform and totally ordered). This work, mainly developed in C, is also openly available from github.

Appearance as of May 2018

Publications

Book Cover

Knowledge Graphs and Big Data Processing [Open Access]


Editors: Valentina Janev, Damien Graux, Hajira Jabeen, Emanuel Sallinger
Publisher: Springer
DOI: https://doi.org/10.1007/978-3-030-53199-7
2020

  1. Joint Proceedings of the QuWeDa and MEPDaW 2023: 7th Workshop on Storing, Querying and Benchmarking Knowledge Graphs and 9th Workshop on Managing the Evolution and Preservation of the Data Web [Preface; Full Volume]
    Muhammad Saleem, Axel-Cyrille Ngonga Ngomo, Damien Graux, Fabrizio Orlandi, Emetis Niazmand, Gabriela Ydler, Maria-Esther Vidal
    Co-located with the 22th International Semantic Web Conference (ISWC), 2023
  2. Proceedings of the 8th Workshop on Managing the Evolution and Preservation of the Data Web (MEPDaW) [Preface; Full Volume]
    Damien Graux, Fabrizio Orlandi, Emetis Niazmand, Gabriela Ydler, Maria-Esther Vidal
    Co-located with the 21th International Semantic Web Conference (ISWC), 2022
  3. Proceedings of the 7th Workshop on Managing the Evolution and Preservation of the Data Web (MEPDaW) [Preface; Full Volume]
    Fabrizio Orlandi, Damien Graux, Julio Cesar dos Reis, Maria-Esther Vidal
    Co-located with the 20th International Semantic Web Conference (ISWC), 2021
  4. Proceedings of the 1st Ph.D. Workshop on Big Data Analytics [Preface; Full Volume]
    Damien Graux, Valentina Janev
    Co-located with the 3rd International Big Data Analytics Summer School (BDA), 2021
  5. Proceedings of the 6th Workshop on Managing the Evolution and Preservation of the Data Web (MEPDaW) [Preface; Full Volume]
    Fabrizio Orlandi, Damien Graux, Maria-Esther Vidal, Javier D. Fernández, Jeremy Debattista
    Co-located with the 19th International Semantic Web Conference (ISWC), 2020
  6. Joint Proceedings of the 1st International Workshop on Knowledge Graph Building and 1st International Workshop on Large Scale RDF Analytics [Preface; Full Volume]
    David Chaves-Fraga, Pieter Heyvaert, Freddy Priyatna, Juan Sequeda, Anastasia Dimou, Hajira Jabeen, Damien Graux, Gezim Sejdiu, Mohammed Saleem, Jens Lehmann
    Co-located with 16th Extended Semantic Web Conference (ESWC), 2019
  1. Around Semantic Web data: distributed, heterogeneous and advanced processing []
    Damien Graux
    Habilitation Thesis, 2024
  2. Reproduce, Replicate, Reevaluate. The Long but Safe Way to Extend Machine Learning Methods []
    Luisa Werner, Nabil Layaïda, Pierre Genevès, Jérôme Euzenat, Damien Graux
    AAAI, 2024

  3. Large Language Models and Knowledge Graphs: Opportunities and Challenges [PDF]
    Jeff Z. Pan, Simon Razniewski, Jan-Christoph Kalo, Sneha Singhania, Jiaoyan Chen, Stefan Dietze, Hajira Jabeen, Janna Omeliyanenko, Wen Zhang, Matteo Lissandrini, Russa Biswas, Gerard de Melo, Angela Bonifati, Edlira Vakaj, Mauro Dragoni, Damien Graux
    Transactions on Graph Data and Knowledge (TGDK), Volume 1 Issue 1: 2:1-2:38, 2023
  4. Multi Platform-based Hate Speech Detection [PDF]
    Shane Cooke, Damien Graux, Soumyabrata Dev
    ICAART, 2023

  5. LOV-ES: Guiding the Ontology Selection to Structure Textual Data using Topic Modeling [PDF]
    Damien Graux, Anaïs Ollagnier
    ISWC (Posters and Demos), 2022
  6. Multi-Level Visual Tours of Weather Linked Data [PDF]
    Nadia Yacoubi, Damien Graux, Catherine Faron
    Visualization and Interaction for Ontologies and Linked Data (Voila!) collocated with the International Semantic Web Conference (ISWC), 2022
  7. Efficient semantic summary graphs for querying large knowledge graphs [PDF]
    Emetis Niazmand, Gezim Sejdiu, Damien Graux, Maria-Esther Vidal
    The International Journal of Information Management Data Insights, 2022
  8. Navigating the Earth with pure SPARQL [PDF]
    Damien Graux
    The 5th International Workshop on Geospatial Linked Data (GeoLD) collocated with the European Semantic Web Conference (ESWC), 2022
  9. Through the Lens of the Web Conference Series: A Look Into the History of the Web [PDF]
    Damien Graux, Fabrizio Orlandi
    The ACM Web Conference (ex WWW), 2022

  10. Hash-ssessing the freshness of SPARQL pipelines [PDF]
    Damien Graux, Fabrizio Orlandi, Declan O'Sullivan
    ISWC (Posters and Demos), 2021
  11. De-icing federated SPARQL pipelines: a method for assessing the “freshness” of result sets [PDF]
    Damien Graux, Fabrizio Orlandi, Declan O'Sullivan
    Managing the Evolution and Preservation of the Data Web (MEPDaW) collocated with the International Semantic Web Conference (ISWC), 2021
  12. Timelining Knowledge Graphs in the Browser [PDF]
    Damien Graux, Fabrizio Orlandi, Tanmay Kaushik, David Kavanagh, Hailing Jiang, Brian Bredican, Matthew Grouse, and Dáithí Geary
    Visualization and Interaction for Ontologies and Linked Data (Voila!) collocated with the International Semantic Web Conference (ISWC), 2021
  13. Formal Concept Analysis for Semantic Compression of Knowledge Graph Versions [PDF]
    Damien Graux, Diego Collarana, Fabrizio Orlandi
    FCA4AI (Ninth Edition) co-located with IJCAI, 2021
  14. Hints to Save Time when Dealing with Big Data [PDF]
    Damien Graux
    Keynote for the PhD. workshop of the third Big Data Summer School in Belgrade, 2021
  15. Beyond Classical SERVICE Clause in Federated SPARQL Queries: Leveraging the Full Potential of URI Parameters [PDF]
    Olivier Corby, Catherine Faron, Fabien Gandon, Damien Graux, Franck Michel
    WEBIST, 2021
  16. Embedding Knowledge Graphs Attentive to Positional and Centrality Qualities [PDF] [Appendix]
    Afshin Sadeghi, Diego Collarana, Damien Graux, Jens Lehmann
    ECML PKDD, 2021
  17. A fully decentralized triplestore managed via the Ethereum blockchain [PDF]
    Damien Graux, Sina Mahmoodi
    SEMANTiCS, 2021
  18. Involvement of OpenStreetMap in European H2020 Projects [PDF]
    Damien Graux, Thibaud Michel
    State of the Map (Academic Track), 2021
  19. Deploying a Strategy to Unlock Big Data Research and Teaching Activities in the West Balkan Region [PDF]
    Damien Graux, Valentina Janev, Hajira Jabeen, Emanuel Sallinger
    26th ACM Conference on Innovation and Technology in Computer Science Education V. 1 (ITiCSE), 2021
  20. A Big Data Learning Platform for the West Balkans and Beyond [PDF]
    Damien Graux, Valentina Janev, Hajira Jabeen, Emanuel Sallinger
    26th ACM Conference on Innovation and Technology in Computer Science Education V. 2 (ITiCSE), Tips, Techniques and Courseware, 2021
  21. Benchmarking RDF Metadata Representations: Reification, Singleton Property and RDF* [PDF]
    Fabrizio Orlandi, Damien Graux, Declan O'Sullivan
    IEEE 15th International Conference on Semantic Computing (ICSC), 2021

  22. A real-time visual dashboard for Wikidata edits [PDF]
    Damien Graux, Fabrizio Orlandi, Brian Lynch, Isobel Mahon, Odhran Mullen, Alex Mahon, Flora Molnar, Lexes Mantiquilla
    Visualization and Interaction for Ontologies and Linked Data (Voila!) collocated with the International Semantic Web Conference (ISWC), 2020
  23. Semantic Schema Mapping for Interoperable Data-Exchange [PDF]
    Harshvardhan J. Pandit, Damien Graux, Fabrizio Orlandi, Ademar Crotti Junior, Declan O’Sullivan, Dave Lewis
    International Workshop on Ontology Matching (OM) collocated with the International Semantic Web Conference (ISWC), 2020
  24. Verbalizing the Evolution of Knowledge Graphs with Formal Concept Analysis [PDF]
    Martín Arispe Riveros, Mayesha Tasnim, Damien Graux, Fabrizio Orlandi, Diego Collarana
    Natural Language Interfaces for the Web of Data Workshop (NLIWOD) collocated with the International Semantic Web Conference (ISWC), 2020
  25. LAMBDA learning and consulting platform [PDF]
    Valentina Janev, Dejan Paunović, Emanuel Sallinger, Damien Graux
    11th International Conference on eLearning (eLearning), 2020
  26. Federated Query Processing
    Kemele M. Endris, Maria-Esther Vidal, Damien Graux
    Chapter 5 in Knowledge Graphs and Big Data Processing (pages 73-86), 2020
  27. Scalable Knowledge Graph Processing Using SANSA
    Hajira Jabeen, Damien Graux, Gezim Sejdiu
    Chapter 7 in Knowledge Graphs and Big Data Processing (pages 105-121), 2020
  28. Context-Based Entity Matching for Big Data
    Mayesha Tasnim, Diego Collarana, Damien Graux, Maria-Esther Vidal
    Chapter 8 in Knowledge Graphs and Big Data Processing (pages 122-146), 2020
  29. Meta-Hyperband: Hyperparameter optimization with meta-learning and Coarse-to-Fine [PDF]
    Samin Payrosangari, Afshin Sadeghi, Damien Graux, Jens Lehmann
    IDEAL, 2020
  30. MINDS: a translator to embed mathematical expressions inside SPARQL queries [PDF]
    Damien Graux, Gezim Sejdiu, Claus Stadler, Giulio Napolitano, Jens Lehmann
    SEMANTiCS, 2020
  31. How many stars do you see in this constellation? [PDF]
    Fabrizio Orlandi, Damien Graux, Declan O'Sullivan
    ESWC (Poster Track), 2020
  32. Semantic Data Integration for the SMT Manufacturing Process using SANSA Stack [PDF]
    Mohamed Nadjib Mami, Irlán Grangel-González, Damien Graux, Enkeleda Elezi, Felix Lösch
    ESWC (Industry Track), 2020
  33. Knowledge Graph-based Legal Search over German Court Cases [PDF]
    Ademar Crotti Junior, Fabrizio Orlandi, Damien Graux, Murhaf Hossari, Declan O'Sullivan, Christian Hartz, Christian Dirschl
    ESWC (Industry Track), 2020
  34. Establishing a Strong Baseline for Privacy Policy Classification [PDF]
    Najmeh Mousavi Nejad, Pablo Jabat, Rostislav Nedelchev, Simon Scerri, Damien Graux
    IFIP-SEC, 2020
  35. MDE: Multiple Distance Embeddings for Link Prediction in Knowledge Graphs [PDF][arXiv]
    Afshin Sadeghi, Damien Graux, Hamed Shariat Yazdi, Jens Lehmann
    ECAI, 2020

  36. The Query Translation Landscape: a Survey [PDF]
    Mohamed Nadjib Mami, Damien Graux, Harsh Thakkar, Simon Scerri, Sören Auer, Jens Lehmann
    Pre-print version, 2019
  37. Uniform Access to Multiform Data Lakes using Semantic Technologies [PDF]
    Mohamed Nadjib Mami, Damien Graux, Simon Scerri, Hajira Jabeen, Sören Auer, Jens Lehmann
    iiWAS, 2019
  38. SemanGit: A Linked Dataset from git [PDF]
    Dennis Oliver Kubitza, Matthias Böckmann, Damien Graux
    ISWC, 2019
  39. Sparklify: A Scalable Software Component for Efficient evaluation of SPARQL queries over distributed RDF datasets [PDF]
    Claus Stadler, Gezim Sejdiu, Damien Graux, Jens Lehmann
    ISWC, 2019
  40. Squerall: Virtual Ontology-Based Access to Heterogeneous and Large Data Sources [PDF]
    Mohamed Nadjib Mami, Damien Graux, Simon Scerri, Hajira Jabeen, Sören Auer, Jens Lehmann
    ISWC, 2019
  41. Towards Semantically Structuring GitHub [PDF]
    Dennis Oliver Kubitza, Matthias Böckmann, Damien Graux
    ISWC (Posters and Demos), 2019
  42. Querying large-scale RDF datasets using the SANSA framework [PDF]
    Claus Stadler, Gezim Sejdiu, Damien Graux, Jens Lehmann
    ISWC (Posters and Demos), 2019
  43. How to feed the Squerall with RDF and other data nuts? [PDF]
    Mohamed Nadjib Mami, Damien Graux, Simon Scerri, Hajira Jabeen, Sören Auer, Jens Lehmann
    ISWC (Posters and Demos), 2019
  44. Interroger des Lacs de Données en utilisant Spark & Presto [PDF]
    Mohamed Nadjib Mami, Damien Graux, Simon Scerri, Hajira Jabeen, Sören Auer
    BDA (Demo Track), 2019
  45. Big Data Analytics: Lectures from the LAMBDA network [PDF]
    Valentina Janev, Dejan Paunović, Damien Graux, Hajira Jabeen, Emanuel Sallinger, Sahar Vahdati
    10th International Conference on eLearning (eLearning), 2019
  46. Towards A Scalable Semantic-based Distributed Approach for SPARQL query evaluation [PDF]
    Gezim Sejdiu, Damien Graux, Imran Khan, Ioanna Lytra, Hajira Jabeen, Jens Lehmann
    SEMANTiCS, 2019
  47. The Hubs and Authorities Transaction Network Analysis using the SANSA framework [PDF]
    Danning Sui, Gezim Sejdiu, Damien Graux, Jens Lehmann
    SEMANTiCS (Poster Track), 2019
  48. COMET: A Contextualized Molecule-Based Matching Technique [PDF]
    Mayesha Tasnim, Diego Collarana, Damien Graux, Mikhail Galkin, Maria-Esther Vidal
    DEXA, 2019
  49. Towards Measuring Risk Factors in Privacy Policies [PDF]
    Najmeh Mousavi Nejad, Damien Graux, Diego Collarana
    Workshop on Artificial Intelligence and the Administrative State collocated with ICAIL'19 (Position Paper)
  50. Clustering Pipelines of large RDF POI Data [PDF]
    Rajjat Dadwal, Damien Graux, Gezim Sejdiu, Hajira Jabeen, Jens Lehmann
    ESWC 2019 (Poster Track)
  51. Summarizing Entity Temporal Evolution in Knowledge Graphs [PDF]
    Mayesha Tasnim, Diego Collarana, Damien Graux, Fabrizio Orlandi, Maria-Esther Vidal
    MepDaw, WWW (Companion Volume) 2019: 961-965
  52. Querying Data Lakes using Spark and Presto [PDF]
    Mohamed Nadjib Mami, Damien Graux, Simon Scerri, Hajira Jabeen, Sören Auer
    WWW 2019: 3574-3578
  53. Big POI Data Integration with Linked Data Technologies [PDF]
    Spiros Athanasiou, Giorgos Giannopoulos, Damien Graux, Nikos Karagiannakis, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Kostas Patroumpas, Mohamed Ahmed Sherif, Dimitrios Skoutas
    22nd International Conference on Extending Database Technology, Lisbon, Portugal. pp. 477–488 (EDBT 2019)

  54. A Multi-Criteria Experimental Ranking of Distributed SPARQL Evaluators [PDF]
    Damien Graux, Louis Jachiet, Pierre Genevès, Nabil Layaïda
    2018 IEEE International Conference on Big Data (Big Data). IEEE, 2018. p. 693-702
  55. Profiting from Kitties on Ethereum: Leveraging Blockchain RDF Data with SANSA [PDF]
    Damien Graux, Gezim Sejdiu, Hajira Jabeen, Jens Lehmann, Danning Sui, Dominik Muhs, Johannes Pfeffer
    Proceedings of the Posters and Demos Track of the 14th International Conference on Semantic Systems co-located with the 14th International Conference on Semantic Systems (SEMANTiCS 2018), Vienna, Austria, September 10-13, 2018.
  56. MINDS: a translator to embed mathematical expressions inside SPARQL queries [PDF]
    Damien Graux, Gezim Sejdiu, Claus Stadler, Giulio Napolitano, Jens Lehmann
    Technical Report, 2018.

  57. Une classification expérimentale multi-critère des évaluateurs SPARQL répartis [PDF]
    Damien Graux, Louis Jachiet, Pierre Genevès, Nabil Layaïda
    BDA 2017 - 33ème Conférence sur la Gestion de Données - Principes, Technologies et Applications, Nov 2017, Nancy, France. BDA2017
  58. SPARUB: SPARQL UPDATE Benchmark [PDF]
    Damien Graux, Pierre Genevès, Nabil Layaïda
    Technical report, 2017
  59. HAP: Building Pipelines with Heterogeneous Data and Hive [PDF]
    Damien Graux, Pierre Genevès, Nabil Layaïda
    Technical report, 2017

  60. On the Efficient Distributed Evaluation of SPARQL Queries [PDF]
    Damien Graux
    PhD Thesis, 2016
  61. SPARQLGX : Une Solution Distribuée pour RDF Traduisant SPARQL vers Spark [PDF]
    Damien Graux, Louis Jachiet, Pierre Genevès, Nabil Layaïda
    BDA 2016 - 32ème Conférence sur la Gestion de Données - Principes, Technologies et Applications, Nov 2016, Poitiers, France. BDA2016
  62. SPARQLGX in Action: Efficient Distributed Evaluation of SPARQL with Apache Spark [PDF]
    Damien Graux, Louis Jachiet, Pierre Genevès, Nabil Layaïda
    15th International Semantic Web Conference (ISWC 2016 demo paper), Oct 2016, Kobe, Japan. 15th International Semantic Web Conference
  63. Smart Trip Alternatives for the Curious [PDF]
    Damien Graux, Pierre Genevès, Nabil Layaïda
    15th International Semantic Web Conference (ISWC 2016 demo paper), Oct 2016, Kobe, Japan. 15th International Semantic Web Conference
  64. SPARQLGX: Efficient Distributed Evaluation of SPARQL with Apache Spark [PDF]
    Damien Graux, Louis Jachiet, Pierre Genevès, Nabil Layaïda
    The 15th International Semantic Web Conference, Oct 2016, Kobe, Japan. The 15th International Semantic Web Conference, <10.1007/978-3-319-46547-0_9>

  65. TRAINS : a Throughput-Efficient Uniform Total Order Broadcast Algorithm [PDF]
    Michel Simatic, Arthur Foltz, Damien Graux, Nicolas Hascoet, Stéphanie Ouillon, Nathan Reboud, Tiezhen Wang
    NTDS - ICPE 2015 : International Conference on Protocol Engineering (ICPE) and International Conference on New Technologies of Distributed Systems (NTDS), Jul 2015, Paris, France. IEEE, Proceedings NTDS - ICPE 2015 : International Conference on Protocol Engineering (ICPE) and International Conference on New Technologies of Distributed Systems (NTDS), pp.1 - 8, 2015, <10.1109/NOTERE.2015.7293477>

Funded Research Projects

Project [Role] Abstract Date
LAMBDA
[Lecturer]
LAMBDA defines a scientific strategy for stepping up and stimulating scientific excellence and innovation capacity, increasing research capacities and unlocking the research potential of the biggest and the oldest R&D Institute in the ICT area in the whole West Balkan region, turning the Institute Mihajlo Pupin into a regional point of reference when it comes to multidisciplinary ICT competence related to Big Data analytics. Since 2019
SemanGit
[Leader]
SemanGit provides a resource at the crossroads of both Semantic Web and git web-based version control systems. It is actually the first collection of linked data extracted from GitHub based on a git ontology we designed and extended to include specific GitHub features. Since 2018
QualiChain
[Tasks Leader]
QualiChain targets the creation, piloting and evaluation of a decentralised platform for storing, sharing and verifying education and employment qualifications and focuses on the assessment of the potential of blockchain technology, algorithmic techniques and computational intelligence for disrupting the domain of public education, as well as its interfaces with private education, the labour market, public sector administrative procedures and the wider socio-economic developments. 2019
Better
[Task Leader]
BETTER is implementing a Big Data intermediate service layer focused on creating user-centric services and tools, while addressing the full data lifecycle associated with EO data, to bring more downstream users to the EO market and maximise exploitation of Copernicus data and information services. 2018-2019
SLIPO
[Work Package Leader]
SLIPO develops software, models and processes for: transforming conventional POI formats and schemas into RDF data; interlinking POI entities from different datasets; enriching POI entities with additional metadata, including temporal, thematic and semantic properties; fusing Linked POI data in order to produce more complete and accurate POI profiles; assessing the quality of the integrated POI data; offering value added services based on spatial aggregation, association extraction and spatiotemporal prediction. 2018-2019
Clear
[Contributor]
Clear addresses one fundamental challenge of our time: the construction of effective programming models and compilation techniques for the correct, efficient and scalable exploitation of large amounts of data. 2017
Datalyse
[Contributor]
Datalyse is a smart treatment demonstrator dedicated to Big Data focusing on collecting, certificating, integrating, categorizing, securing, enriching and sharing data. 2013-2016

Software Projects

Mentoring Activities

PhD. Co-supervision


Bsc. & Msc. Theses Supervision


Software Engineer Supervision

Research Community Services