jsoup is a Java library for working with real-world HTML.

jsoup: Java HTML Parser

jsoup is a Java library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jquery-like methods.

jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do.

scrape and parse HTML from a URL, file, or string

find and extract data, using DOM traversal or CSS selectors

manipulate the HTML elements, attributes, and text

clean user-submitted content against a safe white-list, to prevent XSS attacks

output tidy HTML

jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree.

Example

Fetch the Wikipedia homepage, parse it to a DOM, and select the headlines from the In the news section into a list of Elements (online sample):

Document doc = Jsoup.connect("http://en.wikipedia.org/").get();

Elements newsHeadlines = doc.select("#mp-itn b a");

Open source

jsoup is an open source project distributed under the liberal MIT license. The source code is available at GitHub.

Getting started

Download the jsoup jar (version 1.7.3)

Read the cookbook introduction

Enjoy!

Development and support

If you have any questions on how to use jsoup, or have ideas for future development, please get in touch via the mailing list.

If you find any issues, please file a bug after checking for duplicates.

Status

jsoup is in general release.

Cookbook contents

مواضيع متعلقة

حل مشكلة خطا عند بدء الاباتشي unknown(): ubable to load dynamic library pgp\extensions\php_iconv.dll

حل مشكلة خطا عند بدء الاباتشي unknown(): ubable to load dynamic library pgp\extensions\php_iconv.dll عند بدء تشغيل برنامج الاباتشي قد يظهر الخطأ التالى unknown(): ubable to load dynamic ..
ASP.net localization Hello, localized world!

.addthis_default_style .at300b,.addthis_default_style .at300bo,.addthis_default_style .at300m{padding:0 2px;}.addthis_default_style .addthis_separator,.addthis_default_style .at300b,.addthis_default_style .at300bo,.addthis_default_style .at300m,.addthis_default_style ..
Java Tip 10: Implement callback routines in Java

Java Tip 10: Implement callback routines in Java Using interfaces to implement the equivalent<BR> of callback functions in Java By John D. Mitchell, JavaWorld.com, 06/01/96 Print Feedback 34 Comments Developers ..

آخر اخبارنا

Alex Amell
Mar 6, 2016
0

اقرا المزيد

Alex Amell
Mar 6, 2016
0

اقرا المزيد

Alex Amell
Mar 6, 2016
0

اقرا المزيد

اهم الفاعليات لهذا الشهر

شهادات العملاء

شركة الفالح للمقاولات

تغريدات تويتر

web3jba @web3jba 4h
"برنامج محاسبة: برنامج محاسبة الى كل الشركات التى تقوم بمشاريع ذات تكاليف كثيرة لديك الان… https://t.co/Y9aiYAxLTN"
عرض التغريدة
alkhateeb_groub @alkhateeb_groub 4h
"شاركنا باسم افضل برنامج محاسبة تعمل عليه ؟؟ مجموعة الخطيب للمحاسبة والتدقيق والتحكيم المالي والاستشارات الضريبية"
عرض التغريدة
edara_arabia @edara_arabia 4h
"مجانا : برنامج #محاسبة متكامل هذا البرنامج يمكنك من تسجيل الدخل والمصروفات وعمل #ميزانية الشهرية والسنوية... https://t.co/hiWH94zgk3"
عرض التغريدة
egydrem @egydrem 4h
"#إيجي_موب : برنامج حسابات رائع قد تدفع الكثير من المال فى برنامج معقد لن يفيدك فى شئ وقد تدفع القليل فى برنامج... http://t.co/mcFDTffZ7R"
عرض التغريدة

jsoup: Java HTML Parser

Example

Open source

Getting started

Development and support

Status

Cookbook contents

Introduction

Input

Extracting data

Modifying data

Cleaning HTML

مواضيع متعلقة

حل مشكلة خطا عند بدء الاباتشي unknown(): ubable to load dynamic library pgp\extensions\php_iconv.dll

ASP.net localization Hello, localized world!

Java Tip 10: Implement callback routines in Java

آخر اخبارنا

اهم الفاعليات لهذا الشهر

شهادات العملاء

انقلنا الي استخدام نظام آفاق لقد كانت نقلة نوعية لقسم الحسابات لدينا

شكرا آفاق على تسهيل العمل وربط الفروع مع نقاط البيع بدون اتصال ومع اتصال

من بين عدة انظمة للمحاسبة وقع اختيارنا على آفاق للمحاسبة لانه مناسب لطبيعة نشاطنا وانتشار فروعنا في مناطق مختلفة

آفاق للحسابات العامة نشكركم على هذا المنتج الرائع والي مزيد من التقدم والرقي لهذا المنتج الرائع

نقاط البيع واصدار الفواتير عبر نظام آفاق بالفعل هو الافضل مع تجربتنا لاكثر من نظام محاسبي كان آفاق الافضل من بينها

الشبكات الإجتماعية

الوسوم

تغريدات تويتر