X-Git-Url: https://git.xandkar.net/?a=blobdiff_plain;f=TODO;h=2d4efd99a53d512a1cc0833108802bbeb902b817;hb=d3ac9e11ef7903d22b29241574b106a434f8ddd0;hp=11d77ee7b7d477c50a0345cdb8776ab4a3e7acee;hpb=a60c484e9105bad3ec0bd90148b27f44dbfeb266;p=tt.git diff --git a/TODO b/TODO index 11d77ee..2d4efd9 100644 --- a/TODO +++ b/TODO @@ -10,7 +10,13 @@ Legend: In-progress ----------- - +- [-] timeline limits + - [x] by time range + - [ ] by msg count + - [ ] per peer + - [ ] total + Not necessary for short format, because we have Unix head/tail, + but may be convinient for long format (because msg spans multiple lines). - [-] Convert to Typed Racket - [x] build executable (otherwise too-slow) - [-] add signatures @@ -100,6 +106,20 @@ In-progress Backlog ------- +- [ ] Support date without time in timestamps +- [ ] Associate cached object with nick. +- [ ] Crawl downloaded web access logs +- [ ] download-command hook to grab the access logs + + (define (parse log-line) + (match (regexp-match #px"([^/]+)/([^ ]+) +\\(\\+([a-z]+://[^;]+); *@([^\\)]+)\\)" log-line) + [(list _ client version uri nick) (cons nick uri)] + [_ #f])) + + (list->set (filter-map parse (file->lines "logs/combined-access.log"))) + + (filter (λ (p) (equal? 'file (file-or-directory-type p))) (directory-list logs-dir)) + - [ ] user-agent file as CLI option - need to run at least the crawler as another user - [ ] Support fetching rsync URIs - [ ] Check for peer duplicates: @@ -122,10 +142,8 @@ Backlog - [ ] download times per peer - [ ] Support redirects - should permanent redirects update the peer ref somehow? -- [ ] Support time ranges (i.e. reading the timeline between given time points) - [ ] optional text wrap - [ ] write -- [ ] timeline limits - [ ] peer refs set operations (perhaps better done externally?) - [ ] timeline as a result of a query (peer ref set op + filter expressions) - [ ] config files @@ -154,6 +172,9 @@ Backlog Done ---- +- [x] Crawl all cache/objects/*, not given peers. +- [x] Support time ranges (i.e. reading the timeline between given time points) +- [x] Dedup read-in peers before using them. - [x] Prevent redundant downloads - [x] Check ETag - [x] Check Last-Modified if no ETag was provided