Identifiables
Created by: Quazgar
While writing the parser for tsv tables, I realized that it shares issues with the crawler:
- We designed the crawler such that it can be interrupted and called again without problems. Should be the same for the parser
- We want to prevent creation of doubles in the DB.
- It might be necessary to create Entities that are used later on. E.g. one and the same file is referred to in a table multiple times. You will want to insert the file; but only once.
This could be solved by using the concept of identifiables in both cases. A reminder: an identifiable is just an uncomplete record. However the set of properties it has should uniquely identify a record of the corresponding type.
Imported comments:
By Alexander Schlemmer on 2020-11-24T12:06:26.166Z
@henrik_indiscale We recently discussed whether the concept of identifiables could be promoted to the pylib client. I wondered whether the parser mentioned by @quazgar here is probably already using the identifiables from the standard crawler (and this issue could be closed)?
By Timm Fitschen on 2019-09-11T15:03:40.828Z
changed due date to September 18, 2019
By Timm Fitschen on 2019-09-03T06:31:22.548Z
changed due date to September 11, 2019
By Timm Fitschen on 2019-08-30T12:11:59.509Z
unassigned @henrik_indiscale
By Timm Fitschen on 2019-08-30T12:11:47.096Z
changed due date to September 04, 2019
By Timm Fitschen on 2019-08-30T12:11:35.046Z
assigned to @timm.fitschen
By Quazgar on 2019-08-13T07:45:37.541Z
assigned to @henrik_indiscale and unassigned @quazgar