Skip to content
- http://contentcheck.liris.cnrs.fr/#/home (work in progress): source code and relevant material of paper Flash points: Discovering exceptional pairwise behaviors in vote or rating data, Adnene Belfodil, Sylvie Cazalens, Philippe Lamarre, Marc Plantevit. ECML/PKDD conference, 2017.
This work aims to discover contexts that lead well-distinguished collections of individuals to change their pairwise agreement with respect to their usual one.
- https://gitlab.inria.fr/cedar/excel-extractor and https://gitlab.inria.fr/cedar/insee-crawler: source code of the paper Extracting Linked Data from statistic spreadsheets Tien Duc Cao, Ioana Manolescu, Xavier Tannier, Workshop on Semantic Big Data (SBD), next to the SIGMOD conference, 2017.
In this work, we collect spreadsheets from the insee.fr Web site, we extract the machine-readable data therein, and store it as RDF.
- Tatooine: lightweight data integration for heterogeneous data journalism. This is the source code of the paper Mixed-instance querying: a lightweight integration architecture for data journalism, Raphaël Bonaque, Tien Duc Cao, Bogdan Cautis, François Goasdoué, Javier Letelier, Ioana Manolescu, Oscar Mendoza, Swen Ribeiro, Xavier Tannier, Michaël Thomazo VLDB, Sep 2016, New Delhi, India. VLDB, <http://vldb2016.persistent.com/>.
Tatooine is an integration execution engine, capable of evaluating queries over JSON, relational, and text data. Please contact us if you would like to use the software.
- ConnectionLens: Connecting the dots across heterogeneous data sources. This is the source code of the paper ConnectionLens: Finding Connections Across Heterogeneous Data Sources, Camille Chanial, Rédouane Dziri, Helena Galhardas, Julien Leblay, Minh-Huong Le Nguyen, Ioana ManolescuProceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2018, 11, pp. 2030-2033. 〈10.14778/3229863.3236252〉 (also informally presented at Bases de Données Avancées 2018)