Version 4.13.0 / June 03, 2025
❤️ Sponsor
HtmlUnit@mastodon | HtmlUnit@bsky | HtmlUnit@Twitter
- Overview
- Get it!
- Getting Started
- Features
- Vulnerabilities
- Sponsoring
- Contributing
- Last CI build
- License
- Development
- Some insights
HtmlUnit is a "GUI-less browser for Java programs". It models HTML documents and provides an API that allows you to invoke pages, fill out forms, click links, etc... just like you do in your "normal" browser.
It has fairly good JavaScript support (which is constantly improving) and is able to work even with quite complex AJAX libraries, simulating Chrome, Firefox or Internet Explorer depending on the configuration used.
HtmlUnit is typically used for testing purposes or to retrieve information from web sites.
Add to your pom.xml
:
<dependency>
<groupId>org.htmlunit</groupId>
<artifactId>htmlunit</artifactId>
<version>4.13.0</version>
</dependency>
Add to your build.gradle
:
implementation group: 'org.htmlunit', name: 'htmlunit', version: '4.13.0'
You can start here:
- Getting Started
- Introduction to HtmlUnit - Baeldung
- The Java Web Scraping Handbook A nice tutorial about webscraping with a lot of background information and details about HtmlUnit.
- Web Scraping Examples how to implement web scraping using HtmlUnit, Selenium or jaunt and compares them.
- The Complete Guide to Web Scraping with Java A small straightforward guide to web scraping with Java.
- How to test Jakarta Faces with HtmlUnit and Arquillian
- WebScraping.AI HtmlUnit FAQ
- Support for the HTTP and HTTPS protocols
- Support for cookies
- Ability to specify whether failing responses from the server should throw exceptions or should be returned as pages of the appropriate type (based on content type)
- Support for submit methods POST and GET (as well as HEAD, DELETE, ...)
- Ability to customize the request headers being sent to the server
- Support for HTML responses
- Wrapper for HTML pages that provides easy access to all information contained inside them
- Support for submitting forms
- Support for clicking links
- Support for walking the DOM model of the HTML document
- Proxy server support
- Support for basic and NTLM authentication
- Excellent JavaScript support
HtmlUnit can be used as a Selenium WebDriver-compatible browser through the htmlunit-driver. This integration allows you to use HtmlUnit as a headless browser option within Selenium test suites, providing fast execution without the overhead of launching a full browser instance.
Please have a look at the HtmlUnit Remote project if you like to use this driver from Selenium 4 Grid.
The htmlunit-jsoup library provides utilities
to bridge the gap between HtmlUnit and jsoup.
The HtmlUnitDOMToJsoupConverter
enables seamless integration between HtmlUnit's comprehensive
browser simulation capabilities and all the jsoup-based libraries,
allowing you to leverage the full ecosystem of jsoup tools
while maintaining HtmlUnit's JavaScript execution and dynamic content handling.
HtmlUnit satellite projects
- HtmlUnit on android
- Htmlunit - NekoHtml Parser
- HtmlUnit - CSSParser
- HtmlUnit - CSP
- or core-js our Rhino fork
HtmlUnit is used as the underlying "browser" by different Open Source tools like
- WebDriver
- Arquillian Drone
- Serenity BDD
- XLT
- FluentLenium
- WETATOR
- Selenium Foundation
- Spring Testing
- Selenide
- JWebUnit
- JSFUnit
- ...
HtmlUnit is used by many projects for automated web testing
- jenkins-test-harness
- Apache Shiro
- Apache Struts
- Quarkus
- Togglz
- Dataverse
- Janssen Project
- Apache TomEE
- Apache Maven Surefire
- JSCover
- Apache Jackrabbit
- Apache MyFaces
- JakartaEE TCK
- Jakarta Security
- OpenXava
- Cargo
- piranha cloud
- ...
Constantly updating and maintaining the HtmlUnit code base already takes a lot of time.
I would like to make 2 major extensions in the next few months
For doing this I need your sponsoring.
Pull Requests and all other Community Contributions are essential for open source software. Every contribution - from bug reports to feature requests, typos to full new features - are greatly appreciated.
Please try to keep your pull requests small (don't bundle unrelated changes) and try to include test cases.
The latest builds are available from our Jenkins CI build server
Read on if you want to try the latest bleeding-edge snapshot.
Add the snapshot repository and dependency to your pom.xml
:
<!-- ... -->
<repository>
<name>Central Portal Snapshots</name>
<id>central-portal-snapshots</id>
<url>https://central.sonatype.com/repository/maven-snapshots/</url>
<releases>
<enabled>false</enabled>
</releases>
<snapshots>
<enabled>true</enabled>
</snapshots>
</repository>
<!-- ... -->
<dependencies>
<dependency>
<groupId>org.htmlunit</groupId>
<artifactId>htmlunit</artifactId>
<version>4.14.0-SNAPSHOT</version>
</dependency>
<!-- ... -->
</dependencies>
<!-- ... -->
Add the snapshot repository and dependency to your build.gradle
:
repositories {
maven { url "https://central.sonatype.com/repository/maven-snapshots/" }
// ...
}
// ...
dependencies {
implementation group: 'org.htmlunit', name: 'htmlunit', version: '4.14.0-SNAPSHOT'
// ...
}
This project is licensed under the Apache 2.0 License
Checkout these pages on our website for detailed hints about starting with the development: