This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
{ | |
"partnerName": "archiver", | |
"relatedSources": "null", | |
"fetchTimeStamp": "18-10-2012 22:29:05", | |
"ACE-VERSION": "1.0.5", | |
"ExtractionAlg": "ReadabilitySnack", | |
"actualTimeStamp": "11-05-2011 22:02:29", | |
"wordCount": 40, | |
"Title": "Moody's assigns (P)A3/(P)P-1 to RHB Bank's proposed EMTN programme", | |
"Tags": "News", |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
"event": "<div id=\"story_content\" class=\"clearfix\" algoscore=\"232.0\">\n <p algoscore=\"38.0\"> <a topic_url=\"http://topics.bloomberg.com/cap-gemini-sa/\" href=\"http://www.bloomberg.com/quote/CAP:FP\" density=\"sparse\" title=\"Get Quote\" ticker=\"CAP:FP\" class=\"web_ticker\">Cap Gemini SA</a> and <a topic_url=\"http://topics.bloomberg.com/intel-corp/\" href=\"http://www.bloomberg.com/quote/INTC:US\" density=\"full\" title=\"Get Quote\" ticker=\"INTC:US\" class=\"web_ticker\">Intel Corp</a>.", |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
{ | |
"_index": "algotree", | |
"_type": "news", | |
"_id": "YZQETACPQUOrXleEnPBy2Q", | |
"_version": 9, | |
"exists": true, | |
"_source": { | |
"partnerName": "archiver", | |
"relatedSources": "null", | |
"fetchTimeStamp": "30-09-2012 13:52:20", |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
[2013-03-26 04:52:23,329][DEBUG][action.bulk ] [Arlok] [reloaded][1] failed to execute bulk item (index) index {[reloaded][news][h6cwiFcyRve-nev21h6gwg], source[{"partnerName":"archiver","relatedSources":"null","fetchTimeStamp":"20-10-2012 16:30:57","ACE-VERSION":"1.0.5","ExtractionAlg":"ReadabilitySnack","actualTimeStamp":"10-03-2010 10:17:14","wordCount":181,"Title":"Macquarie upgrades Lockheed, citing re-based jet program","Tags":"News","Link":"http://www.reuters.com/article/2010/03/10/lockheed-upgrade-idUSN1012194820100310","SourceName":"Reuters","ConceptTags":["Macquarie Analyst Rob Stallard","Cost Overruns","Radar-evading Fighter","Program Re-based","News Flow","Macquarie Research","Fighter-jet Program","Weapons Program","Fighter Program","News Reports","Defense Contractor Lockheed Martin Corp","Lockheed Shares","Development Phase","Schedule Delays","Growth Outlook","Morning New York Stock Exchange Trading","Defense Department"],"Content":"<span id=\"articleText\" algoscore=\"107.5\"><span |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
[2013-03-26 04:20:21,116][DEBUG][action.bulk ] [Fatale] [reloaded][0] failed to execute bulk item (index) index {[reloaded][news][UjRLaqaaSvORZLIZU2Yx6A], source[{"partnerName":"archiver","relatedSources":{"relatedSources":[{"Source":"Reuters","Link":null},{"Source":"Reuters","Link":null}]},"fetchTimeStamp":"18-10-2012 02:12:43","ACE-VERSION":"1.0.5","ExtractionAlg":"ReadabilitySnack","actualTimeStamp":"10-06-2010 01:17:00","wordCount":487,"Title":"Citi finds few buyers for unwanted retail cards","Tags":"News","Link":"http://www.moneycontrol.com/news/business/citi-finds-few-buyers-for-unwanted-retail-cards_463343.html","SourceName":"MoneyControl","ConceptTags":["Credit Card Loans","United States","John Grund","Chief Executive Vikram Pandit","Portfolio Brokers","Partner Cards","Citigroup Spokeswoman Shannon Bell","HSBC Holdings PLC","Industry Leader","Brokers Card Portfolio Sales","Higher-than-average Losses","Store Cards","Citigroup Inc","Banco Santander","Banco Santander Spokesman Peter Greiff", |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
root@Dell55:~# curl "localhost:9200/_nodes/stats?pretty=true" | |
{ | |
"cluster_name" : "staging", | |
"nodes" : { | |
"5hgAe0rGQP6NXqIdhg-eHA" : { | |
"timestamp" : 1364225317781, | |
"name" : "Danielle Moonstar", | |
"transport_address" : "inet[/192.168.1.246:9300]", | |
"hostname" : "dell75", | |
"indices" : { |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
dell56@Zeppelin:/var/log/elasticsearch$ l | |
dell56@Zeppelin:/var/log/elasticsearch$ curl -XGET 'http://localhost:9200/_nodes/?settings=true&pretty=true' | |
{ | |
"ok" : true, | |
"cluster_name" : "staging", | |
"nodes" : { | |
"E0S9rZK1TAus-KWvemchyA" : { | |
"name" : "Photon", | |
"transport_address" : "inet[/192.168.1.247:9300]", | |
"hostname" : "dell57", |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
curl -X PUT "http://localhost:9200/reloaded" -d '{ | |
"index" : { | |
"number_of_shards" : 4, | |
"number_of_replicas" : 1 , | |
"analysis":{ | |
"analyzer":{ | |
"content" : { | |
"type" : "custom", | |
"tokenizer" : "standard", | |
"filter" : ["lowercase" , "stop" , "kstem"], |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
public void map(int docID, SearchLookup context) { | |
if(context.doc().containsKey("EMAC-VERSION")){ | |
String emacVersion = ((StringDocFieldData) context.doc().get("EMAC-VERSION")).getValue(); | |
if(emacVersion != null && emacVersion.equals("2.0.0")){ | |
return; | |
} | |
} | |
String id = org.elasticsearch.index.mapper.Uid.idFromUid( ((StringDocFieldData)context.doc().get("_uid")).stringValue()); | |
try{ | |
if(!context.source().containsKey("Content")){ |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
dell56@Zeppelin:/media/backup/ElasticSearch_bak/esbackMar21/algotree$ du -sch * | |
7.5G 0 | |
8.0G 1 | |
4.0K _state | |
16G total | |
dell56@Zeppelin:/media/backup/ElasticSearch_bak/esbackMar21/algotree$ ls -lh 0/index/ | |
total 7.5G | |
-rwxr-xr-x 0 root root 16M Mar 21 10:55 _10xh8.fdt | |
-rwxr-xr-x 0 root root 28K Mar 21 10:50 _10xh8.fdx | |
-rwxr-xr-x 0 root root 4.8K Mar 21 10:50 _10xh8.fnm |