Team : BD - Database
Axes : AID (👥👥), ASN (👥👥).Team leader :
Bernd Amann Campus Pierre et Marie Curie 25-26/514
No event planned at present.
Short presentation
The LIP6 Database research team has a long experience of research in large-scale data processing. Its research activities cover a variety of big data processing issues. The main challenges of the last few years concern the acquisition and indexing of web archives, distributed transaction processing, information filtering and recommendation in social media, JSON schema extraction, graph data processing, and data enrichment.
big data, data streams, distributed data, data filtering and recommendation, semantic web, social media, JSON, RDF, Apache Spark
Selected publications
- Y. Bai, C. Constantin, H. Naacke : “Leiden-Fusion Partitioning Method for Effective Distributed Training of Graph Embeddings” European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), Vilnius (Lituanie), Lithuania[Bai 2024]
- H. Rahimi, J. Hoover, D. Mimno, H. Naacke, C. Constantin, B. Amann : “Contextualized Topic Coherence Metrics” Findings of the Association for Computational Linguistics: EACL 2024, St. Julian's, Malta, pp. 1760-1773, (Association for Computational Linguistics)[Rahimi 2024b]
- H. Rahimi, H. Naacke, C. Constantin, B. Amann : “ATEM: A Topic Evolution Model for the Detection of Emerging Topics in Scientific Archives” Studies in Computational Intelligence, vol. 1143, Studies in Computational Intelligence, Menton, France, pp. 332-343, (Springer Nature Switzerland), (ISBN: 978-3-031-53472-0)[Rahimi 2024a]
- S. Jarrad, H. Naacke, S. Gançarski, M. Gueye : “Embedding-Enhanced Similarity Metrics for Next POI Recommendation” Proceedings of the 12th International Conference on Data Science, Technology and Applications DATA, vol. 1, Rome, Italy, pp. 247-254, (SciTePress - Science and Technology Publications), (ISBN: 978-989-758-664-4)[Jarrad 2023]
- M.‑A. Baazizi, D. Colazzo, G. Ghelli, C. Sartiani, S. Scherzinger : “Negation-closure for JSON Schema” Theoretical Computer Science, vol. 955, pp. 113823, (Elsevier)[Baazizi 2023]
- E. Simon, B. Amann, R. Liu, S. Gançarski : “Controlling the Correctness of Aggregation Operations During Sessions of Interactive Analytic Queries” Journal of data and information quality, (ACM)[Simon 2023]
- L. Attouche, M.‑A. Baazizi, D. Colazzo, G. Ghelli, C. Sartiani, S. Scherzinger : “Witness Generation for JSON Schema” Proceedings of the VLDB Endowment (PVLDB), vol. 15 (13), pp. 4002-4014, (VLDB Endowment)[Attouche 2022]
- Q. Grossetti, C. Du Mouza, N. Travers, C. Constantin : “Reducing the filter bubble effect on Twitter by considering communities for recommendations” International Journal of Web Information Systems, vol. 17 (6), pp. 728-752, (Emerald Publishing Limited)[Grossetti 2021]
- K. Li, H. Naacke, B. Amann : “An Analytic Graph Data Model and Query Language for Exploring the Evolution of Science” Big Data Research, vol. 26, pp. 100247, (Elsevier)[Li 2021]
- M.‑A. Baazizi, D. Colazzo, G. Ghelli, C. Sartiani, S. Scherzinger : “An Empirical Study on the "Usage of Not" in Real-World JSON Schema Documents” 40th International Conference on Conceptual Modeling ER 2021, vol. 13011, Lecture Notes in Computer Science, St. John's, NL (Virtual), Canada, pp. 102-112, (Springer International Publishing), (ISBN: 978-3-030-89021-6)[Baazizi 2021]
- F.‑Z. Hannou, B. Amann, M.‑A. Baazizi : “Explaining Query Answer Completeness and Correctness with Partition Patterns” 30th International Conference on Database and Expert Systems Applications - DEXA 2019, vol. 11707, Lecture Notes in Computer Science, Linz, Austria, pp. 47-62[Hannou 2019b]
- M.‑A. Baazizi, D. Colazzo, G. Ghelli, C. Sartiani : “Schemas and Types for JSON Data: From Theory to Practice” SIGMOD '19 Proceedings of the 2019 International Conference on Management of Data, Amsterdam, Netherlands, pp. 2060-2063, (ACM)[Baazizi 2019d]
- J.‑B. Griesner, T. Abdessalem, H. Naacke, P. Dosne : “ALGeoSPF: A Hierarchical Factorization Model for POI Recommendation” 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Barcelona, Spain, pp. 87-90[Griesner 2018a]
- B. Amann, O. Curé, H. Naacke : “Distributed SPARQL Query Processing: a Case Study with Apache Spark” chapter in NoSQL Data Models: Trends and Challenges, vol. 1, (Wiley), (ISBN: 9781119528227)[Amann 2018]
- Q. Grossetti, C. Constantin, C. Du Mouza, N. Travers : “An Homophily-based Approach for Fast Post Recommendation in Microblogging Systems” Open Proceedings, Vienne, Austria, pp. 229-240[Grossetti 2018]
- X. Ren, O. Curé, H. Naacke, G. Xiao : “BigSR: real-time expressive RDF stream reasoning on modern Big Data platforms” IEEE International Conference on Big Data, Seatle, WA, United States, pp. 811-820[Ren 2018a]
- C. Constantin, C. Du Mouza, W. Litwin, Ph. Rigaux, Th. Schwarz : “AS-Index: A Structure For String Search Using n-grams and Algebraic Signatures” Journal of Computer Science and Technology, vol. 31 (1), pp. 147-166, (Springer Verlag)[Constantin 2016a]
- C. Constantin, R. Dahimene, Q. Grossetti, C. Du Mouza : “Finding Users of Interest in Micro-blogging Systems” International Conference on Extending Database Technology, EDBT 2016, Bordeaux, France[Constantin 2016c]
- Z. Kraljevic, N. Baskiotis, B. Piwowarski, P. Gallinari : “Représentation temporelle des mots : application au clustering de micro-blogs.” Conférence en Recherche d'Infomations et Applications, Toulouse, France, pp. 531-544[Kraljevic 2016]
- L. DOS SANTOS, B. Piwowarski, P. Gallinari : “Multilabel classification on heterogeneous graphs with gaussian embeddings” ECML-PKDD 2016, Riva del garda, Italy[DOS SANTOS 2016]
- O. Curé, H. Naacke, M.‑A. Baazizi, B. Amann : “HAQWA: a Hash-based and Query Workload Aware Distributed RDF Store” The 14th International Semantic Web Conference, ISWC 2015, vol. 1486, CEUR Workshop Proceedings, Bethlehem, Pennsylvania, United States, (CEUR-WS.org)[Curé 2015a]
Contact
bernd.amann (at) nulllip6.fr