Java爬取网页指定内容

2022/3/3 11:15:26

编程Tag： 网页爬取 import Java static test jsoup public IOException

本文主要是介绍Java爬取网页指定内容，对大家解决编程问题具有一定的参考价值，需要的程序猿们随着小编来一起学习吧！

爬取网页文字：

import org.jsoup.Jsoup;
import org.junit.Test;

import java.io.IOException;

public class Crawling {


    public static void Test() throws IOException {
        Jsoup.connect("https://soccer.hupu.com/").get().body().
                getElementsByClass("list-item"). //class="list-item-title"
                forEach(e->{
            System.out.println(e.text());
        });

    }

    public static void main(String[] args) {
        try {
            Test();
        } catch (IOException e) {
            e.printStackTrace();
        }
    }

}

爬取网页图片地址：

import org.jsoup.Jsoup;
import org.junit.Test;

import java.io.IOException;

public class Crawling {

    public static void Test() throws IOException {
        Jsoup.connect("https://soccer.hupu.com/").get().body().
                getElementsByClass("list-item-img").
                forEach(e->{
            System.out.println(e.attr("src")); //src标签图片地址
        });
    }

    public static void main(String[] args) {
        try {
            Test();
        } catch (IOException e) {
            e.printStackTrace();
        }
    }

}

这篇关于Java爬取网页指定内容的文章就介绍到这儿，希望我们推荐的文章对大家有所帮助，也希望大家多多支持为之网！

Java爬取网页指定内容

相关编程文章