Java
004. HTML 파싱하기 - 다음 실시간 이슈
top2blue
2015. 7. 30. 11:34
다음의 실시간 이슈를 가져와보자
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 | import java.io.IOException; import org.jsoup.Jsoup; import org.jsoup.nodes.Document; import org.jsoup.nodes.Element; import org.jsoup.select.Elements; // 다음 실시간 이슈 읽기 public class DaumRealTimeSearchWord { public static void main(String[] args) { Document document = null; try { String url = "http://www.daum.net/"; document = Jsoup.connect(url).get(); Elements elements = document.select("#realTimeSearchWord li"); for(Element e : elements){ Elements rank = e.select("div"); System.out.print(rank.get(1).select("span").get(0).text()); System.out.print(" : "); System.out.print(rank.get(1).select("span").get(1).text()); System.out.print(" => "); String em = rank.get(1).select("em").get(0).text(); // 신규 진입일 경우에는 "상승"항목이 없다. String span=""; if(rank.get(1).select("span").size()>=3) span = rank.get(1).select("span").get(2).text(); System.out.print( span + " ["); System.out.println( em.substring(span.length()) + "]"); String link = rank.get(1).select("a").attr("href"); System.out.println("link : " + link); System.out.println(); } } catch (IOException e) { e.printStackTrace(); } } } | cs |
결과
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 | 1위 : 장진영변호사 => 상승 [13] link : http://search.daum.net/search?w=tot&q=%EC%9E%A5%EC%A7%84%EC%98%81%EB%B3%80%ED%98%B8%EC%82%AC&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd 2위 : 배수정 => 상승 [10] link : http://search.daum.net/search?w=tot&q=%EB%B0%B0%EC%88%98%EC%A0%95&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd 3위 : 미션임파서블5 => 상승 [9] link : http://search.daum.net/search?w=tot&q=%EB%AF%B8%EC%85%98%EC%9E%84%ED%8C%8C%EC%84%9C%EB%B8%945&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd 4위 : 곽정은 => 상승 [6] link : http://search.daum.net/search?w=tot&q=%EA%B3%BD%EC%A0%95%EC%9D%80&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd 5위 : 혁오 표절 논란 => [신규진입] link : http://search.daum.net/search?w=tot&q=%ED%98%81%EC%98%A4+%ED%91%9C%EC%A0%88+%EB%85%BC%EB%9E%80&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd 6위 : 노유민 다이어트 비법 => 상승 [4] link : http://search.daum.net/search?w=tot&q=%EB%85%B8%EC%9C%A0%EB%AF%BC+%EB%8B%A4%EC%9D%B4%EC%96%B4%ED%8A%B8+%EB%B9%84%EB%B2%95&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd 7위 : 대안학교 => 상승 [99] link : http://search.daum.net/search?w=tot&q=%EB%8C%80%EC%95%88%ED%95%99%EA%B5%90&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd 8위 : 드론 => 상승 [86] link : http://search.daum.net/search?w=tot&q=%EB%93%9C%EB%A1%A0&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd 9위 : 원빈 아버지 => 상승 [9] link : http://search.daum.net/search?w=tot&q=%EC%9B%90%EB%B9%88+%EC%95%84%EB%B2%84%EC%A7%80&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd 10위 : 최성국 => 상승 [83] link : http://search.daum.net/search?w=tot&q=%EC%B5%9C%EC%84%B1%EA%B5%AD&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd | cs |