Research project STRIPS

STRIPS – A Semantic Search Toolbox for the Retrieve of Similar Patterns in Luxembourgish Documents

The project at a glance

  • Start date:
    01 Jan 2018
  • Duration in months:
    36
  • Funding:
    University of Luxembourg
  • Principal Investigator(s):
    Peter GILLES

About

The aim of STRIPS is to develop a toolbox of semantic search algorithms for Luxembourgish. We want to implement search algorithms to retrieve and to monitor, e.g., temporal patterns of named entities in Luxembourgish texts. The term semantic, hereby, does not only refer to the usage of keywords or Bag-of-Words like names or geographic identifiers, but fosters also on more complex structures like, for example, on concepts (e.g., topics or themes) and a document’s sentiment (e.g., a positive or a negative polarity of the document). The main focus of STRIPS lies in the linguistic processing of texts written in Luxembourgish (particularly stemming, use of phonetic dictionaries and tagged word list for Luxembourgish; Part-of-speech-tagged text corpus), in similarity learning aspects to allow fuzziness in search queries, and in the identification of temporal cross-dependencies inside the Luxembourgish text corpus. To validate the project, we have given heterogeneous text sources (official news items and user-contributed comments) by RTL.

Image at the top: © generated by AI (copilot)

Organisation and Partners

  • Department of Humanities
  • Faculty of Humanities, Education and Social Sciences (FHSE)
  • Institute of Luxembourgish Language and Literature

Project team

  • Peter GILLES, PI
  • Christoph SCHOMMER, Project member
  • Christoph PURSCHKE, Project member
  • Joshgun SIRAJZADE, Project member, Ministry of Education, Children and Youth (external)
  • Daniela GIERSCHEK, Project member, Ministry of Education, Children and Youth (external)
  • Elisabeth JOY, Project member, – (external)
  • Elida VAN NIEROP, Project member, – (external)
  • Rik LAMESCH, Project member, – (external)

Keywords

  • Algorithms
  • Development
  • Luxembourgish