Filter

My recent searches
Filter by:
Budget
to
to
to
Type
Skills
Languages
    Job State
    275 nutch jobs found, pricing in USD

    Install and cutomize Nutch I would like to see if you have installed this previously Show how to modify visual settings Show how to insert and modify the list of sites I want crawled.

    N/A
    N/A
    0 bids

    ...already available scripts, like Nutch, DataparkSearch, or other script you can recommend. There should be a place where I can specify the Websites, Webpages, and amount of levels down that a Website/Website category should be crawled down. The search engine should be able to hold at least 10,000 sites and more than 1 million pages. The more the better, but I also would like something simple to manage and not a complicated script. I will most likely choose the person/team that can setup a demo in their servers first, so I can try it and see it working properly. There should be a place where I can modify the template and display setting style for the search engine. If searches are extractable using rss/xml, that would be a plus. An existing implementation of Nutch can ...

    N/A
    N/A
    0 bids

    ...of the JAVA Nutch Open Source Search Engine I need a custom version of the open source search engine Nutch. The modifications require addition of a custom metadata keyword field to the Nutch index so that Nutch queries can be searched against this field. The keyword field needs to contain term-specific relevancy (boost) values that can be utilised by a customised version of the Nutch query module. These page-specific keyword and boost values pairs are obtained from a Keyword Database and are injected into the Nutch index during the indexing process. Most of the required functionality already exists in patches to Nutch and only requires incorporation and tweaking of the Nutch code (see background). REQUIRED FEATURES Page Fetchin...

    N/A
    Featured
    N/A
    0 bids

    ...Version of the Nutch Open Source Search Engine We need a custom version of the open source search engine Nutch. The modifications require addition of a custom metadata keyword field to the Nutch index so that Nutch queries can be searched against this field. The keyword field needs to contain term-specific relevancy (boost) values that can be utilised by a customised version of the Nutch query module. These page-specific keyword and boost values pairs are obtained from a Keyword Database and are injected into the Nutch index during the indexing process. Most of the required functionality already exists in patches to Nutch and only requires incorporation and tweaking of the Nutch code (see background). REQUIRED FEATURES Page Fet...

    N/A
    Featured
    N/A
    0 bids

    ...platforms and I think that Nutch is what I want to use. I don't, however, know how to work with Java based applications. You are bidding on installation of Nutch () on a webserver. Depending on your advice this will either be a VPS or a dedicated server. You must also provide me with some basic instructions on modifying the Nutch templates to reflect my search engine and adding advertisements (google, yahoo, etc.). While Nutch is a java application, php must be on the server for future addons I will code myself. I also need basic instructions on how to index the different websites and where to point the crontab in order for it to index nightly. I'm essentially just asking for an installation and consultation. If you've worked with Nutch ...

    $90 (Avg Bid)
    $90 Avg Bid
    3 bids

    I need an example of how to index a nutch crawl using Dotlucene. Please provide the C# source code used to build the index and a simple search demo utilising the index. ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables): a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment. b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will i...

    $82 (Avg Bid)
    $82 Avg Bid
    2 bids

    I'm looking for a complete port of the Java open source search engine Nutch (version 0.7.2) to C#. This project has three distinct core areas: * Web Crawler ??" The Web Crawler must be capable of fetching millions of pages per day. * Indexer - Nutch uses the Lucene API (version 1.9.1) for its indexing. This is also written in Java so will require porting too. * Searcher ??" Please provide an online search application written in ASP.NET utilising the port & index created by the first 2 stages (DotLucene can probably be utilised here..) All code should run under Mono within a Linux platform and should include all functionality contained within the Nutch/Lucene versions stated. ## Deliverables 1) Complete and fully-functional working program(s...

    $8500 (Avg Bid)
    $8500 Avg Bid
    1 bids

    I need someone to install the scrit Nutch for me on my Linux server with Msql database. You will need to install: J2SE 1.4.2 Tomcat 4.1 Then follow the tutorial and do sections: "Whole-web Crawling" "Whole-web: Boostrapping the Web Database" "Whole-web: Fetching" "Whole-web: Indexing" "Searching" Test the sample database that you have indexed and make sure everthing is okay. I will have more work for you to do on this project later, but just quote on the above. I also need it done quickly so if you are really booked up please do not bid. Look at the Tutorial it is self explanitory. PM me for any details you may require. Payment will be put in

    N/A
    N/A
    0 bids

    If you have Nutch and/or Lucene experience - You're a Candidate - If you've written production quality or open source software - You're a Candidate. You will be ask to use open source search engine "Nutch" to build for us a search engine with the following features: - Admin for advertising ads as like in google in the side of search results - Database-interface for import of new URL's and / or advertisings - Keyword-array for indexing through other search engines You should be able to build and test the search engine on a Debian Linux rootserver where you will get full access. If you are interested please email to XXXXXXXXXXXXXXXXXXXXXX Thank you.

    $242 (Avg Bid)
    $242 Avg Bid
    7 bids

    ...themes Most likely, backend should be done in Java, & front end php or Java Frontend to have multi-threaded solution Request for search data to be sent to multiple servers simulataneously - we start with one server, but multi-thread solution to be implemented for future growth Pages for most common queries to be cached Search Engine module as start-point of configuration is Lucene and/or Nutch: _ _It's open source project so there shouldn't be license issues ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables): a) For web sites or other server-side delivera...

    $1870 (Avg Bid)
    $1870 Avg Bid
    1 bids

    If you have Nutch and/or Lucene experience - You're a Candidate - If you've written production quality or open source software - You're a Candidate. You will be ask to use open source as Nutch to buid for us search engine with the following features: Admin for advertising ads as like in google in the side of search results

    $239 (Avg Bid)
    $239 Avg Bid
    7 bids

    Hello. I'm looking for experienced java programmers. Any experience with Nutch and Lucene or search technology a huge asset. Project is a vertical search application. - install, configure appropriate server setup - install, configure lucene / nutch and other chosen plugins - crawl pre-defined sites - eliminate duplicates - add clustering plugin - etc. Please respond if interested and qualified. More details available later on. thanks kidnly for your time. ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables): a) For web sites or other server-side deliverables intended to onl...

    $1190 (Avg Bid)
    $1190 Avg Bid
    3 bids

    Nutch installed on my server.

    $60 (Avg Bid)
    $60 Avg Bid
    1 bids

    I would like Nutch installed. I would like someone who has done this before ans can show me a url if possible of the installation. I may also need customization of nutch after it is installed. Thanks

    N/A
    Featured
    N/A
    0 bids

    We need someone to install Java and Tomcat on a dedicated server. We also need someone to install the open source search engine Nutch on the dedicated server as well and get it ready for us to use. Should be about 2 hour job for someone with dedicated server experience.

    N/A
    N/A
    0 bids

    Our aim is the development of a chinese search engine to rival As a result we seek someone to take the nutch software located at () and develop a search engine around it for us. It would also require development of a simple webpage to search and review results. thanks Any programmer who wishes to download the script for review can find the latest release at () ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables): a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's

    $15625 (Avg Bid)
    $15625 Avg Bid
    3 bids

    We need a technology editor who can write articles about our new search engine which is based on open source technology Nutch. Candidate must be familiar with Nutch structure or quick learner and decribe in article very well. Candidate must also be familiar with technology sites and resources where the article can be posted for maximum exposure of our search engine. Previous experience with technology articles or tutorials is a big plus and more value will be given to people who has advertising experience . If every things goes well we may offer more projects to the the selected coder . For more information about our search engine visit <> And for the search technology we are using visit ## Deliverables 1) Complete

    $11 (Avg Bid)
    $11 Avg Bid
    6 bids

    Hello all, I need of a distributed web crawler + indexing, that can take care of crawls of any size. For example the crawler must be able to crawl & indexing a single website (few web pages) as well as the whole web (over a billion web pages). Installation & configuration : Apache Nutch Thank you

    $176 (Avg Bid)
    $176 Avg Bid
    2 bids

    Boas! Preciso de um ISO para colocar numa máquina virtual com o UBUNTU como Sistema Operativo e tendo o NUTCH instalado e pronto a funcionar com ambiente gráfico.

    $15 / hr (Avg Bid)
    $15 / hr Avg Bid
    5 bids

    Se necesita automatizar la indexación de nutch en solr dentro de una colección ya existente. Dentro de los portales WEB a indexar esta wikipedia la cual se hace de manera diferente a los demás sitios. Todo montado sobre Ubuntu con solr-4.10.1y nutch-1.12. Puede proponer otra manera de hacerlo siempre y cuando se logre automatizar el proceso y realizar consultas desde otro servidor

    $10 - $30
    $10 - $30
    0 bids

    具体需求: 1.在指定服务器安装nutch和抓取我方会提供的20网址。 2.提供nutch具体安装步骤和使用说明 3.抓取内容可导入mysql或solr等,用于查询 4.提供如何查询抓取内容的说明 交付需求: 希望在3天内完成安装和抓取。 (补充说明,我方可以提供服务器供测试使用)

    $33 - $279
    $33 - $279
    0 bids

    Ayudarme a instalar nutch con una base de datos que pueda indexar archivos de topo tipo pdf,xml,doc,etc. y extracción de documentos

    $26 (Avg Bid)
    $26 Avg Bid
    1 bids

    He desarrollado un prototipo web que incluye nutch+solr+wordpress. Wordpress ya responde a consultas contra solr y devuelve resultados en forma de página web a través del plugin "Apache Solr search by WPSOLR". Lo que necesito es concretamente un especialista que habilite la posibilidad de que éste plugin realice consultas contra solr filtrando los resultados por categorias así como que dichas categorias aparezcan en el frontend del buscador. De algún modo también seria interesante la propuesta de que desde nutch (que es el sistema que utilizamos para rastrear las páginas) se pueda definir en el archivo la categoria a la que pertenezca la url a rastrear. Toda la parte de solr, nutch, tomcat, etc. ya est&aacu...

    $174 (Avg Bid)
    NDA
    $174 Avg Bid
    8 bids

    ...Środowisko działania: wyszukiwarka będzie funkcjonował w środowisku UNIX; podczas projektowania od razu należy nastawić się, że skrypt będzie dość mocno obciążony. Front End będzie stał za loadbalancerem + klaster WWW, oddzielny klaster dla bazy danych, nielimitowaną ilość samodzielnych nodów pod Crawler 4. Możliwość wykorzystania istniejących komponentów dostępnych na wolnych licencjach crawlerów (Nutch/Lucene/Crawler4j/YaCy/inne), frameworków lub template engine'ów (musimy zostać o takim fakcie poinformowani) III. Opis funkcjonalności: 1. Front End: 1.1 Założenia: - Wszystkie linki mają być SEO Friendly kluczem jest, aby całość była perfekcyjnie indeksowana przez Google (jedno z najistotniejszych założeń) - Ma...

    min $2
    min $2
    0 bids