Releases: ELTE-DH/WebArticleCurator
Releases · ELTE-DH/WebArticleCurator
v1.4.0
Logger: Add support for multiprocessing through Queue
v1.3.0
- Handle LocationParseError exception
- Allow to change Logger formatting
- Add extra checks in config reading
v1.2.1
Fix support for a one single page column with multiple multipage columns
v1.2.0
Add get_records() function to WarcCachingDownloader
v1.1.0
Add handling multipage articles
v1.0.4
Fix handling cases where no source warc is needed
v1.0.3
- Add support for multiple source WARC files (the last one takes precedence)
- Fix double URL handling
- Minor fixes and refactorings
v1.0.2
Handle UnicodeError and add Makefile to aid packaging