AFAIK, #Brave search uses its own crawler and index, as does #Gigablast. And let us not forget #YaCy. I intend to host a YaCy peer-to-peer search instance on its own VPS (well, shared with a #Searx instance, which will use it)
@lxo #Searx is a metasearching front end. If I want it to give the kind of results I want, I need to select good back end engines for it to use.
For example, if it uses #Bing, the results it receives from Bing will be bad ... Searx can mix better results from elsewhere to improve what it gives to its users, but why not just remove low quality searches entirely?
No idea about their filters, but I consider #Bing a failure at search. Their results are barely better than #Ask.com these days. Both #Yahoo and #DDG use their backend, but somehow have better results (but both are in steep decline).
I do admit that it may vary based on the topics one searches for, but I've been trying out #Google alternatives for years, but no matter which I use, I eventually have to use Google (even if indirectly, such as via #StartPage or #Searx).
>Is #Qwant really that pervasive? Yes, I'd wish I had backed up the interview with Jean Baptiste Piacentino (ceo) had a few years ago with someone who had knowledge in internet anonymity. Anyway the JBP (ceo) is no trust worthy for example he constantly ignored the questions about metadata. And behind the financing of qwant there is the EU https://www.eurofound.europa.eu/observatories/emcc/erm/factsheets/qwant .
>The deep secret among search providers is that the "B" in #Bing stands for "broken". From what I heard it's great for searching porno-graphical content.
>#Seeks & #Searx can get decent results when they're set to use Google as their provider, but if an instance gets popular, it will hit its API limit. Sadly. At least you can use DDG via searx.
>I really want to host another #YaCy node Yacy is neat but needs a big team for it to be viable with time. Something to finance.
@mangeurdenuage Is #Qwant really that pervasive? #Google got to be so big because its results were better than existing search providers. Yahoo used to be close (even with a Bing backend) and #DuckDuckGo was even closer (with Bing, #Yandex, and others), but lately, using a non-G search results in just going back to G.
.
The deep secret among search providers is that the "B" in #Bing stands for "broken". Despite 20+ years of Microsoft pouring money into search, its results are usually comparable in their awfulness with #Ask.
.
#Seeks & #Searx can get decent results when they're set to use Google as their provider, but if an instance gets popular, it will hit its API limit.
@witchescauldron > searx is a way of adding general web search access to this wider network.
So ... what exactly is it you want to do?
a) add something to #Searx so it searches the fediverse (just hashtags or total keyword search of all posts?) OR b) add general web search to fediverse instances OR c) both OR d) something else
@witchescauldron Scripting plugins for #Searx would be something new. If Searx could include #hash2pub relays in their meta-search sources, that would allow Searx instances to search the entire #fediverse, not just Mastodon and PT.
@info_activism for now, maybe. Until they get acquired. The problem with Goggle is its a completely centralized system controlled by a single corporate entity. Switching to a different centralized system, also controlled by a single corporate entity, just moves the problem sideways. The solution lies in some hybrid of free code search projects #Searx and #YaCy.
@xj9 I don't see the context, but I like #YaCy's peer-to-peer search, despite their search algorithm not being anywhere close to #Google in the quality of results (probably close to #Bing, though). I have hosted YaCy instances in the past, in part to help improve the peer network's results by crawling and indexing sites related to databases, Java, Tcl, Python, PHP (the things I was searching for most often at the time).
I intend to host YaCy again (perhaps feeding a #Searx instance, so its results would not be wholly dependent on the goodwill of big corporate search engines).
@mmin got a link about their use of #Bing? That's not necessarily a problem. #Searx instances send their search queries to corporate search engines too. The question is, does Ecosia send searches in batches with no user metada etc, as Searx does?
I wonder if #Searx could be used to create a meta-meta-search portal, which randomly picks 3-5 other Searx instances to send search queries to, and uses the usual methods it applies to results from the big search engines (pruning duplicates etc)?
@bob I technically have a FB account. The damn things are indestructible ;) Huge numbers of those accounts may be similarly abandoned, or bots, or people who only use it for working because they have to etc. Are your numbers for Google, people using their search, using any their user-facing services, or anyone who interacts with all their spy infrastructure? Again, I technically use Google via #Searx searches, and sometimes get linked to text saved on GoogleDocs instead of #Etherpad etc.
@ajroach42 I don't think there is a good and concise write-up about #Searx. You're probably best served by trying out a few Searx instances and reading the short description on https://github.com/asciimoo/searx
@agnelvishal you can search #DDG with #Javascript turned off, which would satisfy #Stallman and I don't think he classifies search engines as #SaaSS (Service-as-a-Software-Substitute). I do use a #Searx instance more often now though, and thanks for the link, I've been curious about #YaCy for a while. Got a link to the source code for that site?