Blame doc/html/parsing.html

Packit Service b74dd5
Packit Service b74dd5
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
Packit Service b74dd5
<head>
Packit Service b74dd5
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
Packit Service b74dd5
<meta name="generator" content="Docutils 0.12: http://docutils.sourceforge.net/" />
Packit Service b74dd5
<title>Parsing XML and HTML with lxml</title>
Packit Service b74dd5
<link rel="stylesheet" href="style.css" type="text/css" />
Packit Service b74dd5
<script type="text/javascript">
Packit Service b74dd5
function trigger_menu(event) {
Packit Service b74dd5
    var sidemenu = document.getElementById("sidemenu");
Packit Service b74dd5
    var classes = sidemenu.getAttribute("class");
Packit Service b74dd5
    classes = (classes.indexOf(" visible") === -1) ? classes + " visible" : classes.replace(" visible", "");
Packit Service b74dd5
    sidemenu.setAttribute("class", classes);
Packit Service b74dd5
    event.preventDefault();
Packit Service b74dd5
    event.stopPropagation();
Packit Service b74dd5
}
Packit Service b74dd5
function hide_menu() {
Packit Service b74dd5
    var sidemenu = document.getElementById("sidemenu");
Packit Service b74dd5
    var classes = sidemenu.getAttribute("class");
Packit Service b74dd5
    if (classes.indexOf(" visible") !== -1) {
Packit Service b74dd5
        sidemenu.setAttribute("class", classes.replace(" visible", ""));
Packit Service b74dd5
    }
Packit Service b74dd5
}
Packit Service b74dd5
</script><meta content="width=device-width, initial-scale=1" name="viewport" /></head>
Packit Service b74dd5
<body onclick="hide_menu()">
Packit Service b74dd5
Packit Service b74dd5

Parsing XML and HTML with lxml

Packit Service b74dd5
Packit Service b74dd5

lxml provides a very simple and powerful API for parsing XML and HTML. It

Packit Service b74dd5
supports one-step parsing as well as step-by-step parsing using an
Packit Service b74dd5
event-driven API (currently only for XML).

Packit Service b74dd5
Packit Service b74dd5

Contents

Packit Service b74dd5
    Packit Service b74dd5
  • Parsers
    • Packit Service b74dd5
    • Parser options
    • Packit Service b74dd5
    • Error log
    • Packit Service b74dd5
    • Parsing HTML
    • Packit Service b74dd5
    • Doctype information
    • Packit Service b74dd5
      Packit Service b74dd5
      Packit Service b74dd5
    • The target parser interface
    • Packit Service b74dd5
    • The feed parser interface
    • Packit Service b74dd5
    • Incremental event parsing
      • Packit Service b74dd5
      • Event types
      • Packit Service b74dd5
      • Modifying the tree
      • Packit Service b74dd5
      • Selective tag events
      • Packit Service b74dd5
      • Comments and PIs
      • Packit Service b74dd5
      • Events with custom targets
      • Packit Service b74dd5
        Packit Service b74dd5
        Packit Service b74dd5
      • iterparse and iterwalk
        • Packit Service b74dd5
        • iterwalk
        • Packit Service b74dd5
          Packit Service b74dd5
          Packit Service b74dd5
        • Python unicode strings
          • Packit Service b74dd5
          • Serialising to Unicode strings
          • Packit Service b74dd5
            Packit Service b74dd5
            Packit Service b74dd5
            Packit Service b74dd5
            Packit Service b74dd5

            The usual setup procedure:

            Packit Service b74dd5
            >>> from lxml import etree