Skip to content

GitLab

Explore

Sign in
Register

Primary navigation

Project

CaosDB Crawler
- Activity
- Members
- Labels
- Issues
- Issue boards
- Milestones
- Iterations
- Wiki
- Environments
- Terraform modules
- Incidents

Snippets Groups Projects

!104

Create a new scanner module and move functions from crawl module there

Review changes
Download
Patches
Plain diff

Merged Create a new scanner module and move functions from crawl module there

f-refactor-scanner-crawler into dev

Overview 37
Commits 49
Pipelines 30
Changes 24

Merged Alexander Schlemmer requested to merge f-refactor-scanner-crawler into dev 2 years ago

Overview 37
Commits 49
Pipelines 30
Changes 24

Summary

Refactoring of the crawler and scanner:

I basically split the crawler into two parts:
- Scanner module that walks through the file system and other structure elements and collects all the information into record types.
- The crawler module (crawl.py) does everything else which needs a CaosDB interaction.

The main steps in refactoring were:

Extraction of the functions related only to the scanning process.
Slight renaming and consistency checks.
Adapting all the tests to the new structure.

Left TODO:

Fixing the integration tests

Focus

The best procedure for the review probably is to go through the individual commits. I tried to keep them as fine-grained as possible. Fixing the tests actually was very repetitive, because the API was changed slightly.

There is one thing that might be solved not completely ideal:

Setting of the runid and the crawled_directory attributes. These are member variables that were previously set by functions now contained in the independent scanner module. This might need some cross-checking with @henrik who probably introduced these variables for logging.

Test Environment

Unittests

I did not run the integration tests manually, but this is probably done by the pipeline.

Check List for the Author

Please, prepare your MR for a review. Be sure to write a summary and a focus and create gitlab comments for the reviewer. They should guide the reviewer through the changes, explain your changes and also point out open questions. For further good practices have a look at our review guidelines

All automated tests pass
Reference related issues
Up-to-date CHANGELOG.md (or not necessary)
Up-to-date JSON schema (or not necessary)
Appropriate user and developer documentation (or not necessary)
- How do I use the software? Assume "stupid" users.
- How do I develop or debug the software? Assume novice developers.
Annotations in code (Gitlab comments)
- Intent of new code
- Problems with old code
- Why this implementation?

Check List for the Reviewer

I understand the intent of this MR
All automated tests pass
Up-to-date CHANGELOG.md (or not necessary)
Appropriate user and developer documentation (or not necessary)
The test environment setup works and the intended behavior is reproducible in the test environment
In-code documentation and comments are up-to-date.
Check: Are there specifications? Are they satisfied?

For further good practices have a look at our review guidelines.

Edited 2 years ago by Florian Spreckelsen

Merge request reports

Activity

Filter activity

Approvals
Assignees & reviewers
Comments (from bots)
Comments (from users)
Commits & branches
Edits
Labels
Lock status
Mentions
Merge request status
Tracking

Alexander Schlemmer requested review from @florian 2 years ago

requested review from @florian
Alexander Schlemmer assigned to @salexan 2 years ago

assigned to @salexan
Alexander Schlemmer added 7 commits 2 years ago
added 7 commits

8cc9c99a - MAIN: renamed load_converters function and removed references to self

f547fa39 - MAINT: made utility and converter registry functions top level functions without references to self

31a6b372 - MAINT: moved main scanner function to scanner module

90620c94 - MAIN: refactored scan_structure_elements and scan_directory functions

40f3cc5f - MAIN: changed name and name of a parameter of main scanner function

9fd76b44 - MAINT: refactored some names in main scanner function

47ea54d8 - MAINT: reintroduced the converters path needed for the debug tree

Compare with previous version
Toggle commit list
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

50a18727 - MAINT: moved debug tree from crawl.py to scanner.py and created a new class in module debug tree

Compare with previous version
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

f1636f29 - MAINT: removed references to self

Compare with previous version
Alexander Schlemmer added 2 commits 2 years ago
added 2 commits

bd146bd3 - MAINT: removed old synchronization function and refactored init method

1aeaa359 - MAINT: finished refactoring of main crawler functions

Compare with previous version
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

ef8d6f66 - MAINT: finished refactoring of crawler module

Compare with previous version
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

7e111959 - TST: started refactoring the tests

Compare with previous version
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

ee67ec55 - TST: completed refactoring of tests

Compare with previous version
Alexander Schlemmer marked this merge request as ready 2 years ago

marked this merge request as ready
Alexander Schlemmer changed the description 2 years ago

changed the description
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

73b06ec0 - DOC: udpated changelog

Compare with previous version
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

bae8cfa2 - TST: Fixed basic integration tests

Compare with previous version
Alexander Schlemmer added 2 commits 2 years ago
added 2 commits

2080b532 - TST: fixed integration test realworld example

73d87184 - FIX: minor typo fixed

Compare with previous version
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

31614d82 - TST: another minor fix in integration tests

Compare with previous version
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

dbeea36b - TST: more small fixes for the integration tests

Compare with previous version
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

a49b4404 - TST: another small fix in integration tests

Compare with previous version
Alexander Schlemmer marked this merge request as draft 2 years ago

marked this merge request as draft
Alexander Schlemmer changed the description 2 years ago

changed the description
Alexander Schlemmer added 1 commit 2 years ago
added 1 commit

e3bc51fc - FIX: fixed test_usses by introducing a function in the crawl module to generate a run id manually

Compare with previous version

Please register or sign in to reply

0 Assignees

0 Reviewers

Request review from

Loading

Labels

0

None

0

None

Select labels

Manage project labels

Milestone

None

None

None

Time tracking

No estimate or time spent

0

0 Participants

Loading