223 Commits (24167257f42d88aefd90de3f45962c1d6cc99e65)
 

Author SHA1 Message Date
Natalie Pendragon 24167257f4 Add .git-blame-ignore-revs file 11 months ago
Natalie Pendragon b087fd6439 [crawl] Make logging message slightly clearer 11 months ago
Natalie Pendragon 0b11c4abfb Check for null input in new strip_control_chars function 11 months ago
Natalie Pendragon f0c4a784b4 Update default logging config to log to both console and file 11 months ago
Natalie Pendragon 43397bdda3 Reformat code with Black 11 months ago
Natalie Pendragon 5eebbbfc00 [crawl] Strip control chars from URLs in crawl logging 11 months ago
Natalie Pendragon cdba245e15 Add exclusion improvement TODO to README 11 months ago
Remco van 't Veer b012dd5cc7 Ignore link like lines in preformatted text blocks 11 months ago
Natalie Pendragon 75378967f7 Add contributors section to about page 11 months ago
Natalie Pendragon c2dd594c45 Fix the index build 11 months ago
Natalie Pendragon 1e63d8b307 Clean up todo list in README 11 months ago
Natalie Pendragon aa3fdeaefb [build_index] Flush index segments to disk periodically 11 months ago
Remco van 't Veer 2774ab2b0e Logging 11 months ago
Remco van 't Veer f3f9d41aa7 Drop unused imports 11 months ago
Natalie Pendragon b85ce1bf4a Update gusmobile clone location in pyproject.toml 11 months ago
Remco van 't Veer fe1c2054cb Include notes on updating the index 11 months ago
Remco van 't Veer be45fc4596 Describe procedure to get gus up and running 11 months ago
Remco van 't Veer c858691814 Fix missing database column indexed_at on Page 11 months ago
Natalie Pendragon f7de0f8473 [crawl] Add a few new exclusions 11 months ago
Natalie Pendragon 72c6ccbf81 [build_index] Perform prefix-based URL exclusion during index build 11 months ago
Natalie Pendragon c0210d90cf [serve] Add "jump to page" functionality to search 1 year ago
Natalie Pendragon 5d7627a3f2 [serve] Upgrade to Jetforce v0.6.0 1 year ago
Natalie Pendragon 3756e5becf [serve] Add more quotes 1 year ago
Natalie Pendragon 6ca5c355d7 [serve] Update documentation and links a bit 1 year ago
Natalie Pendragon 6ddea2105b [serve] Add dynamic quotes to footer 1 year ago
Natalie Pendragon c67268608f [serve] Add newest pages endpoint, revamp documentation and index 1 year ago
Natalie Pendragon 6df4e561eb [serve] Add newest hosts route 1 year ago
Natalie Pendragon 22145a7abd [serve] Remove extra quotation mark in add seeds template 1 year ago
Natalie Pendragon 86bf28edff [crawl] Print change_frequency 1 year ago
Natalie Pendragon 8a0c456fb9 Fix bug in GeminiResource url construction 1 year ago
Natalie Pendragon c5b0648dcc [threads] Only work with textual pages 1 year ago
Natalie Pendragon d993f6cbd0 [serve] Add favicon.txt route 1 year ago
Natalie Pendragon 6adb5336b5 [serve] Add IP addresses to about page 1 year ago
Natalie Pendragon b6ddd91524 [threads] Add different sort orders for threads 1 year ago
Natalie Pendragon 3d014404a2 [serve] Improve feed matching 1 year ago
Natalie Pendragon 2468177399 Update naming 1 year ago
Natalie Pendragon fae9d9d5fe [crawl] Improve handling of change_frequency 1 year ago
Natalie Pendragon 0b45da52c1 [serve] Add Known Feeds page 1 year ago
Natalie Pendragon 34be029c65 [threads] Add collapsible log variations 1 year ago
Natalie Pendragon a2607cd721 [threads] Fix thread ordering 1 year ago
Natalie Pendragon 6ea24fffbb [crawl] Index more errors 1 year ago
Natalie Pendragon 93722e6759 [crawl] Add change_frequency backoff 1 year ago
Natalie Pendragon 632a4cb16c Bump dependencies 1 year ago
Natalie Pendragon f75751e5b9 Add friendly authors and titles for threads 1 year ago
Natalie Pendragon 8c1399ade9 Threads v1 1 year ago
Natalie Pendragon ded0c0ca62 [serve] Save searches to db 1 year ago
Natalie Pendragon 39010248c1 [build_index] [serve] Distinguish cross-capsule backlinks 1 year ago
Natalie Pendragon c341bb82ae [crawl] Add is_cross_host_like field to db 1 year ago
Natalie Pendragon 6c187c2af2 Gitignore all the indexes 1 year ago
Natalie Pendragon 3212bff302 Bump dependencies 1 year ago