Create setup for test_crawler
- create minimal setup that can replace the old scifolder scenario
- create a test for synchronize (but only the stuff that is specific to that function)
- refactor provenance debug with new setup
- the split_stuff test functions should not be affected - make sure