freeradical.zone is one of the many independent Mastodon servers you can use to participate in the fediverse.
Infosec and privacy and technology and leftward politics and cats and dogs and...

Server stats:

260
active users

#yacy

0 toots0 participants0 toots today

Estoy haciendo un experimento con YaCy, un buscador p2p para indexar internet. Que sitios interesantes para escanear se les ocurren? Sitios que tengan info sin tener que loguearse, como bibilotecas, tutoriales, manuales, enciclopedias, conocimiento, tecnología, cultura, atre, literatura, etc. Comenten que enlaces les parecen importantes asi los voy agregando a la lista de crawl, quiero ver que se puede lograr. Monte un servidor dedicado exclusivamente a esto, a escanear internet, es medio un delirio, pero es tanta la basura que me tiran los disquebuscadores que me parece que me voy a montar el mio propio #yacy #p2p #buscadores #search #engine #internet #undernet

Replied in thread

@domanipagani @filippodb @dansup sono andata a cercare info su #duckduckgo e ho avuto una cattiva sorpresa. Sia quella che #ecosia usano Bing 😖 cioè Microsoft 🤮 . Quindi alla fine contribuiamo ancora con il gigante USA. A quanto pare le uniche open source / libere di rapporti con le grandi aziende / che non raccolgono dati e rispettano la privacy / progettate nella UE , sono #SearNXG e #YaCy . Ma SearNXG è più " prestante " e semplice da usare ( secondo ciò che dicono, ancora non ho provato.

Ooooh. Stumbled upon a post comparing 13 rust crates for extracting text from html (via This Week in Rust), which led me to dom_smoothie, and this is such a great find.

I'm currently using #YaCy as a personal search engine, but wanted to migrate off of it. A large part of that effort is cleaning the HTML for readability, so that I only index the good parts. This looks like a good fit for that purpose!

I made a proof of concept of the indexer & search interface, that part was straightforward. The missing piece is the crawler, and a big part of that is the HTML cleaning.

Looks like that might be solved now. So after my adventure into AI poisoning, I might just end up building the personal search engine I wanted to build.

Evan SchwartzComparing 13 Rust Crates for Extracting Text from HTMLApplications that run documents through LLMs or embedding models need to clean the text before feeding it into the model. I'm building a personalized content...

I mentioned it a few days ago, and indeed decided to host my own instance of the #yacy #p2p #search engine. I set up my own #domain for it and set myself up as a Senior peer. It's kind of hypnotizing watching my instance crawl and index its own little corner of the #web. I might do a blog post. Join in the fun! There are 200+ peers out there and 2B indexed documents. yacy.net/

yacy.netHome - YaCyYaCy P2P - Decentralized Search Engine
Replied to klokanek

@klokanek #Yacy jsem několikrát testoval, během strašně moc let (snad 15ti? ten vývoj všeho strašně zamrzl...)

Druhý pokus byl výrazně použitelnější, než ten první, ale to nic nemění na tom, že to použitelné v podstatě nebylo. Ano, některé výsledky vyhledávání byly nečekané, protože to prolézalo různá zákoutí netu, která se vůbec nedostanou na Google, ale pro každodenní práci to nebylo. A přesně tam jsem řešil, co by mělo být výchozím bodem hypertextového vyhledávání... a byl to pomalej moloch, náročnej na paměť.

Moc jsem nepochopil ten jejich koncept "distributed hash table", ale samozřejmě byl asi pokročilejší, než federovaná cache Mastodonu, podle všeho. Problém se mnou je, že nejsem úplně fanoušek hashování, takže se mi to nechtělo úplně studovat.

Mastodon je po letech mého zájmu o různé decentralizované experimenty, včetně třeba Jabberu, samozřejmě včetně mnoha let trápení s konvenčním e-mailem a DNS systémem, zatím nejzdařilejší open source decentralizovaná platforma, se kterou mám zkušenost. Pokud je budoucnost vyhledávání mimo Google, mělo by to být alespoň takhle dobré...

Replied to Chao-c'

@xChaos jo, myslím, že jsme se o tom už bavili, jediná použitelná decentralizovaná varianta vyhledávače je #yacy. našel jsem i nějakou instanci orientovanou na #mastodon, zdá se, že experimenty zatím vycházejí. bohužel #yacy je psaný v #java a chyběj mu vývojáři, kteří by to posunuli o kus dál. (moloch náročnej na paměť, relevance funguje blbě a velmi laxně fixovaný bugy.) běžících instancí je relativně dost po celým světě.
yacy.net/
yacy.net/material/Description_
yacystats.de/

yacy.netHome - YaCyYaCy P2P - Decentralized Search Engine

**Searching in IPFS with YaCy**

YaCy is a decentralized search engine that can integrate with IPFS (InterPlanetary File System), enabling search across a distributed network. IPFS is a distributed file storage system that does not rely on central servers, instead utilizing a P2P network where data is stored across multiple nodes worldwide. Together, YaCy and IPFS combine decentralized search and storage capabilities, offering a robust solution for privacy-focused data access.

### How YaCy Integrates with IPFS:
1. **Indexing Data from IPFS**: YaCy can be configured to index links to data stored on IPFS, making it possible to find and browse files, documents, and other resources shared by IPFS users.
2. **Accessing Files via Hash Identifiers**: In IPFS, files are accessed using unique hashes (CIDs). YaCy can handle these hashes, allowing you to search for specific IPFS files if you know the hash.
3. **Full Decentralization**: With IPFS integration, YaCy provides search capabilities across a decentralized database, bypassing central servers. This is valuable for users who prioritize privacy, as both IPFS and YaCy enable information access without typical internet censors and centralized controls.

### Benefits of Using YaCy with IPFS:
- **Privacy and Confidentiality**: Users can search and browse data on a decentralized network without sharing personal information.
- **Censorship Resistance**: Files on IPFS are not dependent on individual servers and can remain accessible even if the original uploaders go offline.
- **Data Control**: YaCy users can easily search and manage their IPFS storage, retaining complete control over their data.

### Conclusion
YaCy’s integration with IPFS provides access to decentralized data, enhancing privacy and reliability for searching and accessing information. This solution is especially relevant for users seeking autonomy and wishing to avoid traditional search engines that often track user behavior and preferences.

**Hashtags:**
#YaCy #IPFS #DecentralizedSearch #P2P #Privacy #CensorshipFreeInternet #Integration #OpenSource #InformationFreedom #DigitalAutonomy

fediverse-decentralize.blogspo

fediverse-decentralize.blogspot.com**Searching in IPFS with YaCy**YaCy is a decentralized search engine that can integrate with IPFS (InterPlanetary File System), enabling search across a distributed network. IPFS is
Replied in thread

(2/3)

We really need a decentralised search solution that goes deeper than Searx. How come YaCy never took off?

Question for the fediverse search haters out there;

How would you feel about a fully Free Code web search engine that used the fediverse as a source of links to crawl, to build a freely-licensed search index?

Ça y est !! :blobaww:
Mon nœud #yacy fait maintenant parti du grand réseaux décentralisé !!!
:ablobcool:

J’ai bataillé un peu avec ma box, mais finalement il suffisait d’ouvrir un port réseau.

J’aime beaucoup construire mon propre index à partir de flux RSS, de mes favoris, des partages ici où la !!!

Par contre j’ai encore du mal avec le tri des résultats qui parfois sont bien et parfois juste tellement random que ça en est drôle !!!! :rblobglare: :blobsmilehappyeyes:
#websearch