× You must be logged in to access this page.
× You must be logged in to access this page.
× You must be logged in to access this page.

jesus_zen_drod

Teams

Dribs

  • consider implementing a consistent syntax to be used for all future BGEs (structured, ie. XML- or JSON-based)
2 years ago

Conclusions relative to HTML parser: - works reasonably well so far, there may be some occasional data loss - too many inconsistencies in the text data to reliably re-structure the data ; human post-processing of the script output is a must.

2 years ago

splitting references to extract their Art./Abs./lit. components.

2 years ago

data pulled from bger.li is unstructured ; parser tries to address this issue

2 years ago

dedicated HTML parser for bger.li

2 years ago

dedicated HTML parser for bger.li

2 years ago

Certificate icon by Gregor Cresnar - CC BY 3.0