Comments - pootriarch - Sprites kbin instance

This profile is from a federated server and may be incomplete. Browse more on the original instance.

pootriarch, 1 year ago to piracy in The New York Times tried to block the Internet Archive: another reason to value the latter

It exists, it’s called a robots.txt file that the developers can put into place, and then bots like the webarchive crawler will ignore the content.

the internet archive doesn’t respect robots.txt:

Over time we have observed that the robots.txt files that are geared toward search engine crawlers do not necessarily serve our archival purposes.

the only way to stay out of the internet archive is to follow the process they created and hope they agree to remove you. or firewall them.

blog.archive.org/…/robots-txt-meant-for-search-en…

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pootriarch, 2 years ago to privacyguides in Logseq: A privacy-first, open-source platform for knowledge management.

i’d never heard of this concept! i have a disorganized stack of markdown files - notes, to-do and packing lists - that this looks ideal to tame

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...