decruft 0.1.2

Extract clean, readable content from web pages
Documentation
<!DOCTYPE html>
<html>
<head>
  <meta charset="utf-8" />
  <title>Data Table Test</title>
</head>
<body>
  <article>
    <h1>Programming Language Comparison</h1>
    <p>Here is a comparison of popular programming languages:</p>
    <table>
      <thead>
        <tr>
          <th>Language</th>
          <th>Year</th>
          <th>Typing</th>
          <th>Primary Use</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <td>Python</td>
          <td>1991</td>
          <td>Dynamic</td>
          <td>General purpose, data science</td>
        </tr>
        <tr>
          <td>Rust</td>
          <td>2015</td>
          <td>Static</td>
          <td>Systems programming</td>
        </tr>
        <tr>
          <td>TypeScript</td>
          <td>2012</td>
          <td>Static</td>
          <td>Web development</td>
        </tr>
        <tr>
          <td>Go</td>
          <td>2009</td>
          <td>Static</td>
          <td>Cloud infrastructure</td>
        </tr>
      </tbody>
    </table>
    <p>Each language has its strengths and trade-offs.</p>

    <h2>Without thead</h2>
    <table>
      <tr>
        <td>Name</td>
        <td>Score</td>
      </tr>
      <tr>
        <td>Alice</td>
        <td>95</td>
      </tr>
      <tr>
        <td>Bob</td>
        <td>87</td>
      </tr>
    </table>
  </article>
</body>
</html>