Skip to content

Commit

Permalink
Fix 24hu_rest3 config and stabilize checkurls
Browse files Browse the repository at this point in the history
  • Loading branch information
dlazesz committed Dec 13, 2021
1 parent ba2b8af commit a8da9e0
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion configs/site_schemas/24hu_rest3_schema.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@
"date_last_article": 2005-05-31

"elet-stilus2":
"archive_url_format": "https://24.hu/elet-stilus/#year/#month/"
"archive_url_format": "https://24.hu/elet-stilus/#year/#month/#day/"

"date_first_article": 2005-06-01
"date_last_article": 2010-12-31
Expand Down
2 changes: 1 addition & 1 deletion webarticlecurator/other_modes.py
Original file line number Diff line number Diff line change
Expand Up @@ -85,7 +85,7 @@ def archive_page_contains_article_url(extract_article_urls_from_page_plus_fun, s

url_to_fname = {}
archive_page_for_checked_urls = defaultdict(set)
for url in w.url_index:
for url in sorted(w.url_index):
raw_html = w.download_url(url)
if raw_html is not None:
article_urls_w_meta = extract_article_urls_from_page_plus_fun(raw_html)
Expand Down

0 comments on commit a8da9e0

Please sign in to comment.