Java

004. HTML 파싱하기 - 다음 실시간 이슈

top2blue 2015. 7. 30. 11:34

다음의 실시간 이슈를 가져와보자

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
 
import java.io.IOException;
 
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.nodes.Element;
import org.jsoup.select.Elements;
 
// 다음 실시간 이슈 읽기
public class DaumRealTimeSearchWord {
    public static void main(String[] args) {
        Document document = null;
        try {
            String url = "http://www.daum.net/";
            document = Jsoup.connect(url).get();
 
            Elements elements = document.select("#realTimeSearchWord li");
            for(Element e : elements){
                Elements rank = e.select("div");
                System.out.print(rank.get(1).select("span").get(0).text());
                System.out.print(" : ");
                System.out.print(rank.get(1).select("span").get(1).text());
                System.out.print(" => ");
                String em = rank.get(1).select("em").get(0).text();
                // 신규 진입일 경우에는 "상승"항목이 없다.
                String span="";
                if(rank.get(1).select("span").size()>=3)
                     span = rank.get(1).select("span").get(2).text();
                
                System.out.print( span + " [");
                System.out.println( em.substring(span.length()) + "]");
                String link = rank.get(1).select("a").attr("href");
                System.out.println("link : " + link);
                System.out.println();
            }
        } catch (IOException e) {
            e.printStackTrace();
        }
    }
}
 
 
cs

결과

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
1위 : 장진영변호사 => 상승 [13]
link : http://search.daum.net/search?w=tot&q=%EC%9E%A5%EC%A7%84%EC%98%81%EB%B3%80%ED%98%B8%EC%82%AC&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
2위 : 배수정 => 상승 [10]
link : http://search.daum.net/search?w=tot&q=%EB%B0%B0%EC%88%98%EC%A0%95&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
3위 : 미션임파서블5 => 상승 [9]
link : http://search.daum.net/search?w=tot&q=%EB%AF%B8%EC%85%98%EC%9E%84%ED%8C%8C%EC%84%9C%EB%B8%945&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
4위 : 곽정은 => 상승 [6]
link : http://search.daum.net/search?w=tot&q=%EA%B3%BD%EC%A0%95%EC%9D%80&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
5위 : 혁오 표절 논란 =>  [신규진입]
link : http://search.daum.net/search?w=tot&q=%ED%98%81%EC%98%A4+%ED%91%9C%EC%A0%88+%EB%85%BC%EB%9E%80&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
6위 : 노유민 다이어트 비법 => 상승 [4]
link : http://search.daum.net/search?w=tot&q=%EB%85%B8%EC%9C%A0%EB%AF%BC+%EB%8B%A4%EC%9D%B4%EC%96%B4%ED%8A%B8+%EB%B9%84%EB%B2%95&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
7위 : 대안학교 => 상승 [99]
link : http://search.daum.net/search?w=tot&q=%EB%8C%80%EC%95%88%ED%95%99%EA%B5%90&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
8위 : 드론 => 상승 [86]
link : http://search.daum.net/search?w=tot&q=%EB%93%9C%EB%A1%A0&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
9위 : 원빈 아버지 => 상승 [9]
link : http://search.daum.net/search?w=tot&q=%EC%9B%90%EB%B9%88+%EC%95%84%EB%B2%84%EC%A7%80&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
10위 : 최성국 => 상승 [83]
link : http://search.daum.net/search?w=tot&q=%EC%B5%9C%EC%84%B1%EA%B5%AD&rtupcoll=DQP,NNS&guide=rissue&DA=ATG&nil_profile=rtupkwd
 
 
 
 
cs