Salta al contenuto principale



The Open-Source Software Saving the Internet From AI Bot Scrapers

404media.co/the-open-source-so…

#news #tech #technology #security #privacy #AI #AISlop


The Open-Source Software Saving the Internet From AI Bot Scrapers


For someone who says she is fighting AI bot scrapers just in her free time, Xe Iaso seems to be putting up an impressive fight. Since she launched it in January, Anubis, a “program is designed to help protect the small internet from the endless storm of requests that flood in from AI companies,” has been downloaded nearly 200,000 times, and is being used by notable organizations including GNOME, the popular open-source desktop environment for Linux, FFmpeg, the open-source software project for handling video and other media, and UNESCO, the United Nations organization for educations, science, and culture.

Iaso decided to develop Anubis after discovering that her own Git server was struggling with AI scrapers, bots that crawl the web hoovering up anything that can be used for the training data that power AI models. Like many libraries, archives, and other small organizations, Iaso discovered her Git server was getting slammed only when it stopped working.

“I wasn't able to load it in my browser. I thought, huh, that's strange,” Iaso told me on a call. “So I looked at the logs and I figured out that it's restarted about 500 times in the last two days. So I looked in the access logs and I saw that [an] Amazon [bot] was clicking on every single link.”

Iaso knew it was an Amazon bot because it self identified as such. She said she considered withdrawing the Git server from the open web but that because she wants to keep some of the source code hosted there open to the public, she tried to stop the Amazon bot instead.

“I tried some things that I can’t admit in a recorded environment. None of them worked. So I had a bad idea,” she said. “I implemented some code. I put it up on GitHub in an experimental project dumping ground, and then the GNOME desktop environment started using it as a Hail Mary. And that's about when I knew that I had something on my hands.”

There are several ways people and organizations are trying to stop bots at the moment. Historically, robots.txt, a file sites could use to tell automated tools not to scrape, was a respected and sufficient norm for this purpose, but since the generative AI boom, major AI companies as well as less established companies and even individuals, often ignored it. CAPTCHAs, the little tests users take to prove they’re not a robot, aren’t great, Iaso said, because some AI bot scrapers have CAPTCHA solvers built in. Some developers have created “infinite mazes” that send AI bot scrapers from useless link to useless link, diverting them from the actual sites humans use and wasting their time. Cloudflare, the ubiquitous internet infrastructure company, has created a similar “AI labyrinth” feature to trap bots.

Iaso, who said she deals with some generative AI at her day job, told me that “from what I have learned, poisoning datasets doesn't work. It makes you feel good, but it ends up using more compute than you end up saving. I don't know the polite way to say this, but if you piss in an ocean, the ocean does not turn into piss.”

In other words, Iaso thinks that it might be fun to mess with the AI bots that are trying to mess with the internet, but in many cases it’s not practical to send them on these wild goose chases because it requires resources Cloudflare might have, but small organizations and individuals don’t.

“Anubis is an uncaptcha,” Iaso explains on her site. “It uses features of your browser to automate a lot of the work that a CAPTCHA would, and right now the main implementation is by having it run a bunch of cryptographic math with JavaScript to prove that you can run JavaScript in a way that can be validated on the server.”

Essentially, Anubis verifies that any visitor to a site is a human using a browser as opposed to a bot. One of the ways it does this is by making the browser do a type of cryptographic math with JavaScript or other subtle checks that browsers do by default but bots have to be explicitly programmed to do. This check is invisible to the user, and most browsers since 2022 are able to complete this test. In theory, bot scrapers could pretend to be users with browsers as well, but the additional computational cost of doing so on the scale of scraping the entire internet would be huge. This way, Anubis creates a computational cost that is prohibitively expensive for AI scrapers that are hitting millions and millions of sites, but marginal for an individual user who is just using the internet like a human.

Anubis is free, open source, lightweight, can be self-hosted, and can be implemented almost anywhere. It also appears to be a pretty good solution for what we’ve repeatedly reported is a widespread problem across the internet, which helps explain its popularity. But Iaso is still putting a lot of work into improving it and adding features. She told me she’s working on a non cryptographic challenge so it taxes users’ CPUs less, and also thinking about a version that doesn’t require JavaScript, which some privacy-minded disable in their browsers.

The biggest challenge in developing Anubis, Iaso said, is finding the balance.

“The balance between figuring out how to block things without people being blocked, without affecting too many people with false positives,” she said. “And also making sure that the people running the bots can't figure out what pattern they're hitting, while also letting people that are caught in the web be able to figure out what pattern they're hitting, so that they can contact the organization and get help. So that's like, you know, the standard, impossible scenario.”

Iaso has a Patreon and is also supported by sponsors on Github who use Anubis, but she said she still doesn’t have enough financial support to develop it full time. She said that if she had the funding, she’d also hire one of the main contributors to the project. Ultimately, Anubis will always need more work because it is a never ending cat and mouse game between AI bot scrapers and the people trying to stop them.

Iaso said she thinks AI companies follow her work, and that if they really want to stop her and Anubis they just need to distract her.

“If you are working at an AI company, here's how you can sabotage Anubis development as easily and quickly as possible,” she wrote on her site. “So first is quit your job, second is work for Square Enix, and third is make absolute banger stuff for Final Fantasy XIV. That’s how you can sabotage this the best.”




Saviano e Galimberti alla 20°edizione di Con_Vivere
Il Festival della Fondazione Cassa di Risparmio di Carrara è stato presentato al Consiglio Regionale: per l'occasione sono stati rivelati i nomi dei protagonisti e la locandina dell'evento

noitv.it/2025/07/saviano-e-gal…












Thank you again to @brothersoul and @DJUpNorth for talking to me about @labr . Please check out the video or audio versions below.

#Peertube #VOD - video.firesidefedi.live/w/u3y3…
#Castopod #Fedicast - audio.firesidefedi.live/@fires…
#Youtube - youtube.com/channel/UCaJ15PXgR

All #Links - firesidefedi.live

#stream #owncast #live #interview #firesideFedi #FsF #people #peopleOverPlatforms #protocolsOverPlatforms #fedi #fediverse #open #internet #openInternet #podcast #fedicast #livestream #show #episode #peertube #vod #castopod #writefreely #lemmy #boostplease #fedizen #btfree #bigTechFree #nonprofit #signup #tubeFree

If you're enjoying the show, please consider supporting our new nonprofit btfree.org at givebutter.com/btfree. We're currently running tubefree.org which is a moderated peertube open for signups right now!

FediThing 🏳️‍🌈 reshared this.



Úvod do Fediverse: Moderní podoby sociální sítě


Toto video je barvitým úvodem do sociální sítě Fediverse, natočené režisérkou a propagátorkou Fediverse Elenou Rossini. Objevte nový svět sociálních médií, kde je respektováno Vaše soukromí, klíčoví jsou uživatelé a velké technologické společnosti nemají žádný vliv.

Autor videa: Elena Rossini a tým
Produkce: Jan Dytrych
Dabing: Zloběna
Časování audia: Schmaker
Skript: Jann

Questa voce è stata modificata (2 mesi fa)


Midnight blackout: Iran’s internet went down across most of the country in a massive multi-ISP outage. #IranInternetStatus #keepiton #InternetShutdown


LOL. MAGATs love them some AI! So fucking LAZY.

Lawyers for MyPillow CEO Fined for Using AI to Write Court Filing
thedailybeast.com/lawyers-for-…

#MikeLindell #MAGA #MyNazi #AI #USPol



#gazetadobrasil #jornalismo #noticias #politics Senado pode aprovar legalização de cassinos, bingos, jogo do bicho e corridas de cavalos em todo o país gazetabrasil.com.br/politica/2…


"America is a dumpster fire"
Seen in Bellingham, Washington


Mais quand on dit qu'il faudrait envisager l'idée que le numérique n'est pas éternel, et que ce serait bien de ne pas mettre tous nos oeufs dans le même panier, on se fait regarder comme de dangereux Amish adeptes de cavernes éclairées à la graisse de mammouth.

_____
Semiconductor industry could short out as copper runs dry • The Register
theregister.com/2025/07/08/cop…



Hmm, I need to go get a package at the door but an entire flock of quail is outside. 🤔#wildlife
Unknown parent

mastodon - Collegamento all'originale
AI6YR Ben
@Beedazzled Charming, yet not very intelligent. LOL. They do make a pleasant cooing noise walking around.




Several years ago, I came across this image and I saved it... As I lay here, some tears in my eyes, wishing my gall a good night... I remembered I had this image. I don't know where it came from, but it's so very true...
Arwen, have a good night sweets, I miss you so much... 💔

Hopefully @altbot can help me with the text, as it's hard to type when crying...

in reply to Pixy's Journey

The image depicts a woman standing with her back to the viewer, gazing at a starry night sky. She is wearing a blue long-sleeve shirt, green pants, and brown shoes. In her right hand, she holds a red dog leash. Beside her, a translucent, ethereal dog stands, appearing to be a ghostly representation of a pet. The dog is looking up at the woman, and its body is illuminated with a soft, glowing light. The background features a dark sky filled with stars, transitioning from deep blue to purple hues near the horizon, suggesting a twilight or nighttime setting. The landscape includes silhouettes of trees and a hill, adding depth to the scene. The image conveys a sense of longing and remembrance, as suggested by the text overlay at the top, which reads: "Someone asked me 'What's the most difficult thing about owning a Dog?' ...i replied 'the Goodbye'."

Provided by @altbot, generated privately and locally using Ovis2-8B

🌱 Energy used: 0.216 Wh



This Tuesday and Wednesday Linux install parties in #Netherlands and #Canada (all times local)!

* Tues. 8 July -- Fablab #Amersfoort (#Utrecht) 19h-22h

fablabamersfoort.nl/2025/05/07…

* Wed. 9 July -- #RichmondBC Public Library (#Vancouver) 17h30-19h30

meetup.com/vanlug-bc/events/30…

For details and more events worldwide: endof10.org/events/

#EndOf10 #FreeSoftware #OpenSource #FOSS #Linux #GNULinux #Windows #Windows10 #Windows11

reshared this





Jukebox - Open source alternative to Spotify's Collaborative Playlists

🕵️ Anonymous accounts: no sign up or email needed

✨Share a link, add songs together

🚀No app download or login required

⭐ 100% free no ads

jukeboxhq.com/

Questa voce è stata modificata (2 mesi fa)

reshared this




youtu.be/9HAStL8P_gQ
⚡️🇺🇦Chinese companies openly begin supplying Russian military as West looks on impotently (War and Politics 24 - Ukrainian VIDEO) #Ukraine #NukesForUkraine #Germany #France #Italy #OSCE #PACE #CoE #SouthKorea #Press #News #Taiwan #Media #Japan #USA #US #UK #EU #NATO #UnitedStates #UnitedKingdom
#EuropeanUnion #russiaUkraineWar
#11yrInvasionofUkraine
#RussiaIsATerroristState #TrumpIsARussianAsset

Gazzetta del Cadavere reshared this.




#Amarok 3.3 Open-Source Music Player Is Out as First Release Fully Ported to Qt 6 9to5linux.com/amarok-3-3-open-…

@kde #Linux #OpenSource #FreeSoftware

in reply to 9to5Linux

Looks a lot like 2000s iTunes tbh. When iPod was a thing.


Onion Services Design in depth
media.ccc.de/v/gpn23-52-onion-…
watch?v=Iho4Q


Browsing through my old photos and stumbled upon this one of Finn from when he was a puppy. You're welcome 😀 #puppy #dog


Video Game History Foundation Library – Digital Archive

The VGHF Digital Archive is the portal for our digitally preserved content. You can directly access our digital collections and search through the full text of documents, magazines, transcripts, and more.

This library is a permanent work in progress. Not all materials are currently cataloged or digitized, and our library system may change in the future. For a more complete list of VGHF’s holdings, visit the Library Catalog.

welchwrite.com/blog/2025/07/08…

#game #gaming #history #archive #research #shared

in reply to Douglas E. Welch

The image displays a row of six white rectangular boxes, each representing a different collection. The first box on the left features an image of a sketch and is labeled "Craig Stitt art and design papers." The second box shows a digital screen with a person and is labeled "Cyan collection." The third box displays a colorful CD cover and is labeled "GamePro press CD collection." The fourth box shows a book cover with a dark, ominous design and is labeled "FromSoftware promotional material collection." The fifth box features a book cover with a desert landscape and is labeled "Electronic Entertainment Expo (E3) directories." The sixth box shows a magazine cover with various articles and is labeled "Magazine Library." Each box has a blue icon in the top left corner, and the text is in blue font.

Provided by @altbot, generated privately and locally using Ovis2-8B

🌱 Energy used: 0.164 Wh




globalist.it/culture/2025/07/0…

Interessante



youtu.be/CJIkf0utrw4


ICE is attacking immigrants for now, but their goal is to subjugate all of us. Fighting for our neighbors today is a way of fighting for ourselves tomorrow.

Map the infrastructure that ICE depends on. Publicize their vulnerabilities. Popularize simple, reproducible ways to impose consequences every time that ICE inflicts harm on a community. Don't just react to their attacks—choose the time and place of confrontations. Take the initiative.

crimethinc.com/zines/seven-ste…

"If we know, and do nothing, we are worse than the murderers hired in our name.

"If we know, then we must fight for your life as though it were our own—which it is—and render impassable with our bodies the corridor to the gas chamber. For, if they take you in the morning, they will be coming for us that night."

-James Baldwin, writing to Angela Davis while she was in captivity, November 19, 1970

reshared this

in reply to CrimethInc. Ex-Workers

We're at approximately 1933 nazi Germany socially.
Already have the jack booted thugs.


Die Sache mit den Zuchtlinien bei Hühnern ist insgesamt ziemlich spannend.
Das System hat sich in den 1980er Jahren global durchgesetzt, weil es Hühnerfleisch konkurrenzlos billig machte.

Außerdem hat es gleich erstmal ne Pandemie bei Menschen ausgelöst. Die Älteren erinnern sich vielleicht...

spektrum.de/video/salmonellen-…



Marktbericht: US-Börsen kommen nicht in Gang

Die Unsicherheit über die Zollpolitik der Regierung hat die Wall-Street-Anleger heute gebremst. Der DAX setzte seine Klettertour in Richtung Rekordhoch hingegen fort.

➡️ tagesschau.de/wirtschaft/finan…

#Marktbericht



Trump pulls plug on UK research into air pollution and global warming


Scientists decry President's 'vandalism' of climate science as they lose access to data and collaborators


Archived version: archive.is/20250708140933/inew…


Disclaimer: The article linked is from a single source with a single perspective. Make sure to cross-check information against multiple sources to get a comprehensive view on the situation.


in reply to Matthias

@Matthias Ich habe es aufgegeben, dieser Mastodon Zirkus wird nie aufhören. Wenn dann bald noch die Zitatefunktion mit 4.5 final ausgerollt wird, dann brennt wieder die Hütte. Bin froh, dass die Instanz meines Musikaccounts in der Hinsicht völlig entspannt ist. Außerdem haben wir dort passende 666 Zeichen. 🤘🏻

-----
"Ich denke, also bin ich hier falsch." (frei nach René Descartes)
in reply to ⚝ Mirk0 ⚝ on Friendica (closing 17/07/2025)

@⚝ Mirk0 ⚝ on Friendica
Die Vermutung ist, dass es inkompatibel zu den bisherigen Implementierungen sein wird, auf die sich die Projekte im Fediverse geeinigt haben. In dem Fall muss man erneut die Frage stellen, ob wirklich META umarmt und einverleibt oder ob auch andere Projekte im Fediverse mit Marktmacht nicht umgehen können? Ich sage nur "Don't be Evil".
Questa voce è stata modificata (2 mesi fa)
Unknown parent

@Jupiter Rowland
Am Ende wird interessant sein, wie sie es tatsächlich umgesetzt haben. Hier die von dir angesprochenen FEP
codeberg.org/fediverse/fep/src…
codeberg.org/fediverse/fep/src…

Interessant ist diese Passage:

"Compatibility with other quote implementations

(This section is non-normative.)

While this FEP introduces https://w3id.org/fep/044f#quote, there are competing definitions for the representation of quote posts:

  • _misskey_quote (https://misskey-hub.net/ns/#_misskey_quote)
  • quoteUrl (https://www.w3.org/ns/activitystreams#quoteUrl)
  • quoteUri (http://fedibird.com/ns#quoteUri)
  • FEP-e232 Object links with a https://misskey-hub.net/ns/#_misskey_quote rel value

We believe each of those to have significant drawbacks, such as re-using a namespace that has no definition for them, implying the value is an URL or URI, or using an unusual naming scheme, and none of them are linked to a control mechanism like the one defined in this FEP, hence why we introduced https://w3id.org/fep/044f#quote.

That being said, we suggest some of them as fallback for compatibility with existing fediverse software implementations."



Very good video on how fanatics are born (and how you might be one)youtu.be/gysxm7PfPmM?...

How Ordinary People Learn to H...



"Britain’s mainstream media have not carried out a single investigation into the extent, impact or legal status of the more than 500 surveillance flights over Gaza that the RAF has carried out since December 2023.

The Ministry of Defence continues to insist that the operations, carried out by Shadow R1 aircraft based at RAF Akrotiri in Cyprus, are designed purely to assist with the discovery of Israeli hostages taken by Hamas on 7 October 2023.

It appears that Britain’s obedient defence correspondents have no appetite to challenge this or even to raise the slightest concern about the legal or ethical implications of providing intelligence support to Israel in the middle of a genocide.

Yet thanks to dogged work by campaigners, independent journalists and pro-Palestine MPs, we know both that the flights are continuing to operate (as they did even throughout the ceasefire) and that spikes in the number of flights have coincided with especially deadly Israeli attacks on Gaza.

The lack of curiosity on the part of mainstream media is perhaps not surprising but it is deeply troubling."

declassifieduk.org/uk-media-ar…

#UK #Israel #RAF #Palestine #Gaza #Genocide #HumanRights

reshared this



Poof! not proof: The Epstein Files Vanish
#TheList #Coverup #ACAB

Mutahar - #BlackPill #Blackmail
youtube.com/watch?v=YCCQzXmSk8…
#BlackBall #Fatalism
incels.wiki/w/Blackpill

America This Week 7/7/25
racket.news/p/america-this-wee…
* #Bondi clips at beginning

the message: “spit in the face of the public” and the Elite as court of Versailles
rumble.com/v6vvwba-saagar-enje…
#TCN #Zionism

Questa voce è stata modificata (2 mesi fa)


„Das Lager werde auch dazu dienen, den radikalen Emigrationsplan für die Palästinenser umzusetzen. »Denn der wird kommen«, zitierten Medien, deren Vertreter bei dem Briefing anwesend waren, den Minister.“ www.spiegel.de/ausland/rafa...

Rafah: Israel plant Auffanglag...




Angriffe auf Solaranlagen: Blackout-Gefahr aus China? – Unser MONITOR-Film über ein unterschätztes Risiko. "Ich kann das nicht verhindern" sagt Bundesinnenminister Dobrindt von der CSU dazu. Wirklich nicht?youtu.be/AcfUgfFkan4?...

Solar-Sabotage: Gefahr aus Chi...



Tear down this Splinternet wall!

Sneakernet Stegano-graphy
splintercon.net/relevant-work
en.wikipedia.org/wiki/Steganog…

censorship.no
gitlab.com/equalitie/ouinet/bl…
#Ceno Browser (not updated in 10 months)
could be used in conjunction with BeePass or #Nym for additional privacy in #BT friendly countries
#mobile #censorship #Iran
privacyaccelerator.org

#CodeIsLaw #BT
Internet Governance Bodies would account for the necessity of procuring funding for media production by establishing an arts and culture foundation (like Library of Congress, Mos Film, or Federal Theater WPA) to support the means of access, which is needed when making cultural media a public access community commons via a news protocol (BT = public fast cache)

* creating a collective funding reserve for quality information disseminated by internet protocol for censorship-resistant direct access

* hackers realized that email is the news (without PGP), hence slrn (NetworkNewsTP protocol)
but Iran might go nuclear on the People’s cache like DHS

#middleware – consider that Ceno “privacy mode” erases cache
snowflake.torproject.org/
addons.mozilla.org/en-US/firef…
#snowflake #TorProject

Questa voce è stata modificata (2 mesi fa)


Kurz vorm #Fedicamp2025 ein wenig Lagerfeuerstimmung (auch wenn es dieses Mal mit Brandschutz etwas trübe für das Lagerfeuer aussieht) mit einem unserer "Klassiker" und schrammeliger Smartphone-Aufnahme.

B-Haus Song, ein Konny-Cover

cc: @grindhold 😀

#StimmenImFediverse #Fedicamp



The winners of the annual BigPicture Natural World Photography Competition for 2025 have been announced, and they're stunning: bigpicturecompetition.org/2025…

#photography #nature #NaturePhotography



github.com/HW-whistleblower/Tr… pretty fun leak is happening as we speak.
in reply to Bill

there's a bit of cooked Gen Alpha slang in here, and there is a bunch of Huawei internal slang too. If you read anything about divisions or military speak - that's all Huawei org chart bullshit.
Questa voce è stata modificata (2 mesi fa)
in reply to Jonathan Reiter (张飞)

My simplified Chinese is effectively non-existant, so I have to depend on translators. Looses all of the feel of what they were trying to say. But I get the drift. And we thought we had a intrapersonal mess with AI in the USA!! Sheesh.



In this week's episode of @npc, I did a lot of research for the Nintendo Switch 2 dock and its limitations.

I've found a way to use the Switch 2 with my XR glasses: youtube.com/watch?v=9PiOyQN5R4…

And for Patreon subscribers, even more HDMI experiments and a new Weird Cable: patreon.com/posts/fairy-cables…