26 http 16 org 14 apache 14 Solr 12 wiki 12 solr 11 Lucene 8 with 8 Hive 7 search 7 html 7 com 7 UIMA 7 Mahout 6 www 6 uima 6 Nutch 6 Hadoop 5 via 5 nutch 5 check 5 allow 4 result 4 quora 4 proxy 4 php 4 list 4 links 4 from 4 do 4 consider 4 be 4 add 4 PersonalInformationStream 4 Apache 4 Amazon 3 usage 3 through 3 sandbox 3 pmWiki 3 particular 3 mod 3 lucene 3 localhost 3 interface 3 have 3 forward 3 find 3 engine 3 data 3 crawl 3 cf 3 as 3 an 3 also 3 Web 3 Tomcat 3 Services 3 Server 3 Ruby 2 when 2 webserver 2 very 2 system 2 source 2 see 2 revision 2 provides 2 previous 2 port 2 page 2 other 2 only 2 note 2 no 2 names 2 local 2 load 2 lists 2 largest 2 if 2 htdocs 2 files 2 file 2 explored 2 explore 2 error 2 distributed 2 depth 2 default 2 bat 2 already 2 access 2 WithoutNotesSeptember 2 Wikipedia 2 SolrTomcat 2 Seedea 2 Resources 2 Rails 2 October 2 Manning 2 Lucid 2 Local 2 Lighttpd 2 Intelligence 2 Imagination 2 HTTPServer 2 HTTP 2 EC 2 Deny 2 Collective 2 Allow 2 AWS 1 you 1 yet 1 xampp 1 would 1 world 1 will 1 wikis 1 while 1 which 1 where 1 websolr 1 way 1 waste 1 warehouse 1 vs 1 using 1 user 1 useful 1 urls 1 upgrade 1 update 1 unmaintainable 1 tutorial 1 trying 1 tried 1 trackgc 1 tr 1 topicmarks 1 topN 1 tomcat 1 time 1 thus 1 threads 1 though 1 them 1 test 1 taken 1 tail 1 table 1 synchronize 1 supports 1 summary 1 stream 1 straight 1 spatial 1 spam 1 sources 1 solution 1 sites 1 simple 1 should 1 services 1 select 1 seems 1 seeks 1 running 1 rubyforge 1 resources 1 repository 1 registered 1 quickly 1 query 1 proposal 1 properly 1 projects 1 project 1 previously 1 present 1 prepare 1 prefix 1 potential 1 popularity 1 pointing 1 plenty 1 place 1 pipes 1 opencalais 1 now 1 newly 1 name 1 my 1 msg 1 most 1 more 1 mirrors 1 meta 1 mention 1 max 1 manage 1 make 1 major 1 mail 1 lucas 1 lti 1 look 1 long 1 location 1 locally 1 limit 1 like 1 lexical 1 level 1 learning 1 large 1 key 1 keep 1 jetty 1 into 1 instead 1 installed 1 installation 1 indexed 1 index 1 included 1 id 1 htm 1 having 1 has 1 handshakes 1 hadoop 1 grub 1 gossamer 1 get 1 geolocalization 1 geolocalisation 1 generator 1 generate 1 general 1 freenode 1 formatted 1 format 1 following 1 focus 1 feature 1 extend 1 explicit 1 existing 1 examples 1 even 1 equivalent 1 entity 1 encounting 1 en 1 edu 1 easy 1 downloads 1 domain 1 doc 1 distribution 1 display 1 dir 1 detection 1 date 1 cygdrive 1 cs 1 crawled 1 correct 1 content 1 consumer 1 connection 1 connect 1 conjunction 1 configure 1 components 1 complex 1 compare 1 coherent 1 code 1 cmu 1 clusters 1 cluster 1 center 1 categories 1 bots 1 blogs 1 bin 1 become 1 based 1 bandwidth 1 aware 1 avoid 1 available 1 articles 1 are 1 archive 1 app 1 annotators 1 annotator 1 annotation 1 analyzer 1 almost 1 allowconnect 1 all 1 advantages 1 admin 1 adding 1 actually 1 acts 1 account 1 WithoutNotesFebruary 1 Will 1 Wiki 1 WhitespaceAnalyzer 1 What 1 URLRewrite 1 Toby 1 Text 1 Sylvain 1 Style 1 StandardAnalyzer 1 Spatial 1 SolrcasUserGuide 1 Solrcas 1 Smiley 1 Single 1 Segaran 1 Search 1 Scratch 1 Satnam 1 SSL 1 RunningNutchAndSolr 1 Running 1 Reilly 1 Reduce 1 Rappoport 1 Pugh 1 Publishing 1 ProxyVia 1 ProxyRequests 1 Programming 1 Processing 1 Probably 1 Previously 1 Presentation 1 PoweredBy 1 PmWiki 1 Pig 1 Person 1 PageFileFormat 1 Packt 1 PDF 1 Order 1 OpenLayersAPI 1 OpenCalais 1 NutchGuideForDummies 1 Note 1 Natural 1 Minutes 1 Mike 1 Media 1 Marmanis 1 March 1 MapReduce 1 MTurk 1 Location 1 Lineland 1 Lars 1 Language 1 Java 1 JFlex 1 Intelligent 1 Integration 1 Integrating 1 Ingersoll 1 Indexing 1 Ignite 1 Heroku 1 Haralambos 1 HTML 1 HDFS 1 Grant 1 George 1 Finished 1 Files 1 Facebook 1 FAQ 1 ExtractingRequestHandler 1 Eric 1 ElasticMap 1 Elastic 1 Downloads 1 Dmitry 1 Diego 1 Dhruba 1 Developer 1 Demo 1 David 1 Crontab 1 Cookbook 1 ContentStream 1 Content 1 Community 1 Carrion 1 BuildingWatson 1 Brevoort 1 Borthakur 1 Blog 1 Babenko 1 Avi 1 Algorithms 1 Alexa 1 Alag 1 ActsAsSolrReloaded 1 Action 1 API