Équipe : BD - Bases de Données
Axes : AID (👥👥), ASN (👥👥).Responsable :
Bernd Amann Campus Pierre et Marie Curie 25-26/514
Aucune manisfestation prévue actuellement.
Brève présentation
L'équipe de recherche Base de Données du LIP6 a une longue expérience de la recherche en traitement de données à large échelle. Les recherches couvrent une variété de problèmes dans l'acquisition, la gestion et l'interrogation de données complexes et volumineuses. Les principaux défis des dernières années concernent l'acquisition et l'indexation d'archives Web, le traitement des transactions distribuées, le filtrage et la recommandation dans les médias sociaux, l'extraction de schémas Json, le stockage et l'interrogation de graphes et l'enrichissement de données et de schémas.
Gestion de données à large échelle, archivage web, flux de données, données distribuées, filtrrage et recommandation, web sémantique, médias sociaux, JSON, RDF, Apache Spark
Sélection de publications
- Y. Bai, C. Constantin, H. Naacke : “Leiden-Fusion Partitioning Method for Effective Distributed Training of Graph Embeddings” European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), Vilnius (Lituanie), Lithuania[Bai 2024]
- H. Rahimi, J. Hoover, D. Mimno, H. Naacke, C. Constantin, B. Amann : “Contextualized Topic Coherence Metrics” Findings of the Association for Computational Linguistics: EACL 2024, St. Julian's, Malta, pp. 1760-1773, (Association for Computational Linguistics)[Rahimi 2024b]
- H. Rahimi, H. Naacke, C. Constantin, B. Amann : “ATEM: A Topic Evolution Model for the Detection of Emerging Topics in Scientific Archives” Studies in Computational Intelligence, vol. 1143, Studies in Computational Intelligence, Menton, France, pp. 332-343, (Springer Nature Switzerland), (ISBN: 978-3-031-53472-0)[Rahimi 2024a]
- S. Jarrad, H. Naacke, S. Gançarski, M. Gueye : “Embedding-Enhanced Similarity Metrics for Next POI Recommendation” Proceedings of the 12th International Conference on Data Science, Technology and Applications DATA, vol. 1, Rome, Italy, pp. 247-254, (SciTePress - Science and Technology Publications), (ISBN: 978-989-758-664-4)[Jarrad 2023]
- M.‑A. Baazizi, D. Colazzo, G. Ghelli, C. Sartiani, S. Scherzinger : “Negation-closure for JSON Schema” Theoretical Computer Science, vol. 955, pp. 113823, (Elsevier)[Baazizi 2023]
- E. Simon, B. Amann, R. Liu, S. Gançarski : “Controlling the Correctness of Aggregation Operations During Sessions of Interactive Analytic Queries” Journal of data and information quality, (ACM)[Simon 2023]
- L. Attouche, M.‑A. Baazizi, D. Colazzo, G. Ghelli, C. Sartiani, S. Scherzinger : “Witness Generation for JSON Schema” Proceedings of the VLDB Endowment (PVLDB), vol. 15 (13), pp. 4002-4014, (VLDB Endowment)[Attouche 2022]
- Q. Grossetti, C. Du Mouza, N. Travers, C. Constantin : “Reducing the filter bubble effect on Twitter by considering communities for recommendations” International Journal of Web Information Systems, vol. 17 (6), pp. 728-752, (Emerald Publishing Limited)[Grossetti 2021]
- K. Li, H. Naacke, B. Amann : “An Analytic Graph Data Model and Query Language for Exploring the Evolution of Science” Big Data Research, vol. 26, pp. 100247, (Elsevier)[Li 2021]
- M.‑A. Baazizi, D. Colazzo, G. Ghelli, C. Sartiani, S. Scherzinger : “An Empirical Study on the "Usage of Not" in Real-World JSON Schema Documents” 40th International Conference on Conceptual Modeling ER 2021, vol. 13011, Lecture Notes in Computer Science, St. John's, NL (Virtual), Canada, pp. 102-112, (Springer International Publishing), (ISBN: 978-3-030-89021-6)[Baazizi 2021]
- F.‑Z. Hannou, B. Amann, M.‑A. Baazizi : “Explaining Query Answer Completeness and Correctness with Partition Patterns” 30th International Conference on Database and Expert Systems Applications - DEXA 2019, vol. 11707, Lecture Notes in Computer Science, Linz, Austria, pp. 47-62[Hannou 2019b]
- M.‑A. Baazizi, D. Colazzo, G. Ghelli, C. Sartiani : “Schemas and Types for JSON Data: From Theory to Practice” SIGMOD '19 Proceedings of the 2019 International Conference on Management of Data, Amsterdam, Netherlands, pp. 2060-2063, (ACM)[Baazizi 2019d]
- J.‑B. Griesner, T. Abdessalem, H. Naacke, P. Dosne : “ALGeoSPF: A Hierarchical Factorization Model for POI Recommendation” 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Barcelona, Spain, pp. 87-90[Griesner 2018a]
- B. Amann, O. Curé, H. Naacke : “Distributed SPARQL Query Processing: a Case Study with Apache Spark” chapter in NoSQL Data Models: Trends and Challenges, vol. 1, (Wiley), (ISBN: 9781119528227)[Amann 2018]
- Q. Grossetti, C. Constantin, C. Du Mouza, N. Travers : “An Homophily-based Approach for Fast Post Recommendation in Microblogging Systems” Open Proceedings, Vienne, Austria, pp. 229-240[Grossetti 2018]
- X. Ren, O. Curé, H. Naacke, G. Xiao : “BigSR: real-time expressive RDF stream reasoning on modern Big Data platforms” IEEE International Conference on Big Data, Seatle, WA, United States, pp. 811-820[Ren 2018a]
- C. Constantin, C. Du Mouza, W. Litwin, Ph. Rigaux, Th. Schwarz : “AS-Index: A Structure For String Search Using n-grams and Algebraic Signatures” Journal of Computer Science and Technology, vol. 31 (1), pp. 147-166, (Springer Verlag)[Constantin 2016a]
- C. Constantin, R. Dahimene, Q. Grossetti, C. Du Mouza : “Finding Users of Interest in Micro-blogging Systems” International Conference on Extending Database Technology, EDBT 2016, Bordeaux, France[Constantin 2016c]
- Z. Kraljevic, N. Baskiotis, B. Piwowarski, P. Gallinari : “Représentation temporelle des mots : application au clustering de micro-blogs.” Conférence en Recherche d'Infomations et Applications, Toulouse, France, pp. 531-544[Kraljevic 2016]
- L. DOS SANTOS, B. Piwowarski, P. Gallinari : “Multilabel classification on heterogeneous graphs with gaussian embeddings” ECML-PKDD 2016, Riva del garda, Italy[DOS SANTOS 2016]
- O. Curé, H. Naacke, M.‑A. Baazizi, B. Amann : “HAQWA: a Hash-based and Query Workload Aware Distributed RDF Store” The 14th International Semantic Web Conference, ISWC 2015, vol. 1486, CEUR Workshop Proceedings, Bethlehem, Pennsylvania, United States, (CEUR-WS.org)[Curé 2015a]
Contact
bernd.amann (at) nulllip6.fr