Gumbo 0.9.2
A C library for parsing HTML.
Loading...
Searching...
No Matches
Gumbo HTML Parser

This provides a conformant, no-dependencies implementation of the HTML5 parsing algorithm. It supports only UTF8; if you need to parse a different encoding, run a preprocessing step to convert to UTF8. It returns a parse tree made of the structs in this file.

Example:

GumboOutput* output = gumbo_parse(input);
do_something_with_doctype(output->document);
do_something_with_html_tree(output->root);
gumbo_destroy_output(&options, output);
GumboOutput * gumbo_parse(const char *buffer)
void gumbo_destroy_output(const GumboOptions *options, GumboOutput *output)
Definition gumbo.h:625
GumboNode * root
Definition gumbo.h:636
GumboNode * document
Definition gumbo.h:630

HTML5 Spec:

http://www.whatwg.org/specs/web-apps/current-work/multipage/syntax.html