Created
November 12, 2012 06:13
-
-
Save kimukou/4057789 to your computer and use it in GitHub Desktop.
jsoup_test
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
// ref http://d.hatena.ne.jp/maji-KY/20110919/1316417705 | |
// http://d.hatena.ne.jp/t-horikiri/20120308/1331182734 | |
// | |
// javadoc http://jsoup.org/apidocs/index.html?overview-summary.html | |
// | |
@Grab(group='org.jsoup', module='jsoup', version='1.7.1') | |
import org.jsoup.Jsoup | |
import org.jsoup.nodes.Document | |
import org.jsoup.nodes.Element | |
import org.jsoup.select.Elements | |
long start = System.nanoTime() | |
//url = "http://www.nicovideo.jp/ranking/fav/daily/imas" | |
url="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&sv=1&ei=UTF-8&md=t" | |
Document document = Jsoup.connect(url).get(); | |
//surl="span[style=color:#C00;]" | |
//surl="div[class=cnt cf]" | |
surl="div[class=cnt cf]" | |
Elements spans = document.select(surl) | |
println(spans.size()) | |
println("-----------------------------------------------------------------") | |
for(Element span : spans) { | |
String datetime = span.select("div").first().attr("data-time"); | |
println datetime | |
String message = span.select("h2").first() | |
println message | |
//Element div = span.parent().parent(); | |
//System.out.println(div.html()); | |
println("-----------------------------------------------------------------") | |
} | |
long time = System.nanoTime() - start | |
System.err.println("execute: "+time/1000000d+"msec.") |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
10 | |
----------------------------------------------------------------- | |
1352700467 | |
<h2>【送料無料】コスパ の 這いよれ ニャル子さん 宇宙からのストレートタンブラー【 <a target="_blank" class="url" href="http://t.co/JXwsTkYN">amzn.to/KiGkPF</a> 】 <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23Amazon&ei=UTF-8&rkf=1">#Amazon</a> <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a> <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23goen&ei=UTF-8&rkf=1">#goen</a></h2> | |
----------------------------------------------------------------- | |
1352700033 | |
<h2>(」・ω・)」うー!(/・ω・)/にゃー <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a></h2> | |
----------------------------------------------------------------- | |
1352699553 | |
<h2>【自動】八坂手うが 手うがと呼んでかまわない <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23%E6%89%8B%E3%81%86%E3%81%8C&ei=UTF-8&rkf=1">#手うが</a> <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a></h2> | |
----------------------------------------------------------------- | |
1352699261 | |
<h2>【自動】何が手うが(照)ですか、あざとい女ですね <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23%E6%89%8B%E3%81%86%E3%81%8C&ei=UTF-8&rkf=1">#手うが</a> <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a></h2> | |
----------------------------------------------------------------- | |
1352698980 | |
<h2><a target="_blank" href="http://twitter.com/Salvere459">@Salvere459</a> お帰りなさいませうっちーさん。私にしますぅ?私がいいですか?・・・・それとも、和・菓・子? <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a> <a target="_blank" class="url" href="http://t.co/ieLOGHuq">twitpic.com/a7333s</a></h2> | |
----------------------------------------------------------------- | |
1352698978 | |
<h2><a target="_blank" href="http://twitter.com/ichuki_A251">@ichuki_A251</a> はぁ~い!!いつもニコニコあなたの隣に這い寄る混沌ニャルラトホテプ、です♡ <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a> <a target="_blank" class="url" href="http://t.co/Hi9D4Tgb">twitpic.com/a351by</a></h2> | |
----------------------------------------------------------------- | |
1352698977 | |
<h2><a target="_blank" href="http://twitter.com/XzangetuX">@XzangetuX</a> ふーふーしてあげましょうか? <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a> <a target="_blank" class="url" href="http://t.co/fumRzaLB">twitpic.com/a3d450</a></h2> | |
----------------------------------------------------------------- | |
1352698886 | |
<h2><a target="_blank" href="http://twitter.com/maruwo_">@maruwo_</a> ん〜じゅわぁ〜♡ <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a> <a target="_blank" class="url" href="http://t.co/Q68n7ic7">twitpic.com/a34aqe</a></h2> | |
----------------------------------------------------------------- | |
1352698885 | |
<h2><a target="_blank" href="http://twitter.com/XzangetuX">@XzangetuX</a> ふーふーしてあげましょうか? <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a> <a target="_blank" class="url" href="http://t.co/jkr5yqeb">twitpic.com/a3d450</a></h2> | |
----------------------------------------------------------------- | |
1352698884 | |
<h2><a target="_blank" href="http://twitter.com/XzangetuX">@XzangetuX</a> お帰りなさいませ*暇人*@ 終焉の天使さん。私にしますぅ?私がいいですか?・・・・それとも、和・菓・子? <a class="url" href="http://realtime.search.yahoo.co.jp/search?p=%23nyaruko&ei=UTF-8&rkf=1"><em>#nyaruko</em></a> <a target="_blank" class="url" href="http://t.co/bzZErK6a">twitpic.com/a7333s</a></h2> | |
----------------------------------------------------------------- |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment