Krptkn (Кропоткин Framework v0.1.0) View Source

Krptkn

Krptkn is a spider, metadata extractor, metadata analyzer and report generator.

Roadmap

  • [x] Spider
    • [x] Extract URLs from HTML
    • [x] Dictionary of common directories
    • [x] Extract from Robots.txt and other files
  • [x] NIF with libextractor
  • [ ] Metadata analyzer
  • [x] Metadata filter
  • [ ] Report generator
  • [ ] Meta
    • [x] Create tests
    • [x] Continuous integration
    • [x] Module documentation
    • [ ] Function documentation
    • [x] Schematics

Schematics

Krptkn's data flow:

alt text