YaCy

















































YaCy
YaCy logo.png
Yacy-buscador.png
Original author(s) Michael Christen
Developer(s) YaCy community
Stable release
1.92
/ 26 December 2016; 2 years ago (2016-12-26)

Repository github.com/yacy/yacy_search_server
Written in Java
Operating system Cross-platform
Type
Overlay network, Search engine
License
GPLv2+
Website www.yacy.net/en

YaCy (pronounced "ya see") is a free distributed search engine, built on principles of peer-to-peer (P2P) networks.[1][2] Its core is a computer program written in Java distributed on several hundred computers, as of September 2006[update], so-called YaCy-peers. Each YaCy-peer independently crawls through the Internet, analyzes and indexes found web pages, and stores indexing results in a common database (so called index) which is shared with other YaCy-peers using principles of P2P networks. It is a free search engine that everyone can use to build a search portal for their intranet and to help search the public internet clearly.


Compared to semi-distributed search engines, the YaCy-network has a decentralised architecture. All YaCy-peers are equal and no central server exists. It can be run either in a crawling mode or as a local proxy server, indexing web pages visited by the person running YaCy on his or her computer. (Several mechanisms are provided to protect the user's privacy). Access to the search functions is made by a locally running web server which provides a search box to enter search terms, and returns search results in a similar format to other popular search engines.


YaCy is available on Windows, Mac and Linux. YaCy was created in 2003 by Michael Christen[3].




Contents






  • 1 System components


  • 2 Search-engine technology


  • 3 YaCy platform architecture


  • 4 See also


  • 5 References


  • 6 External links





System components


YaCy search engine is based on four elements:[4]



Crawler

A search robot that traverses from web page to web page and analyzes their content.

Indexer

Creates a reverse word index (RWI) i.e. each word from the RWI has its list of relevant URLs and ranking information. Words are saved in form of word hashes.

Search and administration interface

Made as a web interface provided by a local HTTP servlet with servlet engine.

Data storage

Used to store the reverse word index database utilizing a distributed hash table.




Homepage of YaCy



Search-engine technology




YaCy network



  • YaCy is a complete search appliance with user interface, index, administration and monitoring.

  • YaCy harvests web pages with a web crawler. Documents are then parsed, indexed and the search index is stored locally. If your peer is part a peer network, then your local search index is also merged into the shared index for that network.

  • A search is started then the local index contributes together with a global search index from peers in the YaCy search network.



YaCy platform architecture




Web search showing results of the different components YaCy uses


YaCy uses a combination of techniques for the networking, administration, and maintenance of indexing the search engine including blacklisting, moderation, and communication with the community. Here is how YaCy performs these operations:



  • Community components

    1. Web forum[5]

    2. Statistics

    3. XML API



  • Maintenance

    1. Web Server

    2. Indexing

    3. Crawler with Balancer

    4. Peer-to-Peer Server Communication



  • Content organization

    1. Blacklisting and Filtering

    2. Search interface

    3. Bookmarks

    4. Monitoring search results





See also




  • Dooble – an open-source web browser with an integrated YaCy Search Engine Tool Widget


References





  1. ^ "YaCy takes on Google with open source search engine". The Register. 2011-11-29. Retrieved 2012-04-16..mw-parser-output cite.citation{font-style:inherit}.mw-parser-output .citation q{quotes:"""""""'""'"}.mw-parser-output .citation .cs1-lock-free a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/6/65/Lock-green.svg/9px-Lock-green.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .citation .cs1-lock-limited a,.mw-parser-output .citation .cs1-lock-registration a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/d/d6/Lock-gray-alt-2.svg/9px-Lock-gray-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .citation .cs1-lock-subscription a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/a/aa/Lock-red-alt-2.svg/9px-Lock-red-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration{color:#555}.mw-parser-output .cs1-subscription span,.mw-parser-output .cs1-registration span{border-bottom:1px dotted;cursor:help}.mw-parser-output .cs1-ws-icon a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/4/4c/Wikisource-logo.svg/12px-Wikisource-logo.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output code.cs1-code{color:inherit;background:inherit;border:inherit;padding:inherit}.mw-parser-output .cs1-hidden-error{display:none;font-size:100%}.mw-parser-output .cs1-visible-error{font-size:100%}.mw-parser-output .cs1-maint{display:none;color:#33aa33;margin-left:0.3em}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration,.mw-parser-output .cs1-format{font-size:95%}.mw-parser-output .cs1-kern-left,.mw-parser-output .cs1-kern-wl-left{padding-left:0.2em}.mw-parser-output .cs1-kern-right,.mw-parser-output .cs1-kern-wl-right{padding-right:0.2em}


  2. ^ "YaCy: It's About Freedom, Not Beating Google". PC World. 2011-12-03. Retrieved 2012-04-16.


  3. ^ "Ich entwickle eine P2P-basierende Suchmaschine. Wer macht mit?". Heise Online (in German). 2003-12-15. Retrieved 2018-05-09.


  4. ^ "YaCy Technology Architecture". YaCy.net. Retrieved 2012-02-14.


  5. ^ "forum.yacy.de". Retrieved 6 June 2017.








External links


  • Official website








Popular posts from this blog

Accessing regular linux commands in Huawei's Dopra Linux

Can't connect RFCOMM socket: Host is down

Kernel panic - not syncing: Fatal Exception in Interrupt