10
Challenge Project

scigate

An unified search entry point into today's highly fragmented legal database landscape and a one-stop shop for legal data.

⛶  Fullscreen ↓  Download

scigate2.pngI. The project The project aims to further develop a project that was started in the Open Legal Lab 2023 and to create legal data showcase, in other words:

  • a unified search entry point into today's highly fragmented landscape of legal databases, and
  • at the same time a low-threshold, accessible one-stop-shop for legal data.

Traditionally, libraries have been the gatekeepers for access to legal data, especially legal texts, but also legal data in the broadest sense. Libraries not only made this data spatially accessible, but also added metadata that made the data itself searchable and discoverable. This role of libraries has changed significantly in recent years. Today, legal data are often made available in databases by different actors, with different access and accessibility.The current fragmentation of access to legal data affects national and international research and its visibility. The project "Gateway to Legal Data" tries to create a counterbalance. Beyond the existing and desirable diversity of data sources, a unified search entry as well as a one-stop-shop for legal data shall be created. Its architecture can be described as follows: Title II. The Challenge A running prototype can be found here: www.scigate.online. The system is in part modularized and should be further modularized. In particular, data sources should be extended, and data aggregation added while supporting more search functionality. The linchpin of scigate.online are so-called proxies, whose task is to address data sources, translate their response and homogenize as far as possible the data to allow a unified search and access via scigate.online.

  1. Part of the challenge will be to build more proxies to connect additional data sources, such as https://onlinekommentar.ch/ and other legal data sources, to the platform. This data will be harmonized as much as possible so that it can be made available via a uniformed API. In the future, this should minimize the need to write a new scraper for each legal data research project.
  2. Another part of the challenge will be to present the data as search results on the platform. The proxies currently collect three lines for each entry plus a link to display the entry. The selection of what should be displayed for each entry, how it could be displayed and what existing functionality of the source systems might be used to render the search as user-friendly as possible, could be optimized. The search could also be extended by including more facets or auto completion.
  3. Finally, the retrieved hitlists and documents could be used to provide additional functionality. They could be fed into AI to mark the most relevant passages, to have an automated summary or to answer a natural language query.

III. Resources Running prototype: www.scigate.online The different code bases can be found here:

Event finished

Edited (version 12)

25.03.2024 13:55 ~ walter_boente

Research

24.03.2024 15:12 ~ oleg

Edited (version 10)

24.03.2024 14:10 ~ oleg

Project

Joined the team

24.03.2024 13:45 ~ ChristopheK

Edited (version 9)

24.03.2024 13:27 ~ walter_boente

Joined the team

24.03.2024 13:22 ~ magdalena_gneist

Edited (version 8)

24.03.2024 13:22 ~ walter_boente

Joined the team

24.03.2024 13:19 ~ damaris_jeker

Joined the team

24.03.2024 12:28 ~ Morteza

Edited (version 6)

24.03.2024 09:08 ~ walter_boente

Edited (version 5)

24.03.2024 09:05 ~ walter_boente

Event started

Edited (version 4)

24.03.2024 08:50 ~ walter_boente

Joined the team

24.03.2024 08:49 ~ walter_boente

Challenge

 
Alle Teilnehmer*innen, Sponsor, Partner, Freiwilligen und Mitarbeiter*innen unseres Hackathons sind verpflichtet, dem Hack Code of Conduct zuzustimmen. Die Organisatoren werden diesen Kodex während der gesamten Veranstaltung durchsetzen. Wir erwarten die Zusammenarbeit aller Teilnehmer*innen, um eine sichere Umgebung für alle zu gewährleisten. Weitere Einzelheiten zum Ablauf der Veranstaltung finden Sie unter Richtlinien auf unsere Webseite.

Creative Commons LicenceDie Inhalte dieser Website stehen, sofern nicht anders angegeben, unter einer Creative Commons Attribution 4.0 International.