Back to home
Features

Everything a serious
scraping team needs.

Liizrd isn't just a scraping library. It's a full extraction platform with compliance, intelligence, and observability baked in from day one.

The Framework

What LIIZRD stands for.

Six foundational layers. One extraction platform built for teams that can't afford to cut corners.

LIIZRD acronym breakdown: Lossless, Intelligent, Immutable, Zero-Trust, Reliability, Data — the 6 foundational layers of the Liizrd framework
SOC2 Compliance

Your compliance team will thank you.

Liizrd generates a full audit trail for every HTTP request, data access event, and schema mutation. Built-in data lineage tracking means you can trace any extracted record back to its source — exactly what SOC2 Type II auditors need.

  • Automatic audit logging

    Every request, response, and data transformation is logged with timestamps, user context, and session IDs.

  • Data lineage tracking

    Trace any data point back to its origin URL, timestamp, and extraction rule.

  • Access controls

    Role-based access to spiders, outputs, and compliance reports.

  • Retention policies

    Configure data retention per-spider with automatic expiry and deletion logs.

  • Automated reports

    Generate SOC2-ready compliance reports on demand for auditors.

example.py
SOC2 Compliance
"token-keyword">import liizrd

"token-keyword">class AuditedSpider(liizrd.Spider):
    name = "products"
    
    "token-keyword">class="token-comment"># SOC2 profile: logs everything
    compliance = liizrd.SOC2Profile(
        log_level="full",
        data_lineage="token-keyword">True,
        retention_days=90,
        access_control=["team:engineering"],
    )
    
    "token-keyword">async "token-keyword">def parse(self, response):
        "token-keyword">class="token-comment"># Every "token-keyword">yield is audit-logged
        "token-keyword">yield {
            "product_id": response.css("h1::text").get(),
            "price": response.css(".price::text").get(),
        }
Why Liizrd

The only framework that takes compliance seriously.

Capability
Liizrd
Scrapy
Crawlee
Playwright
SOC2 compliance built-in
AI-powered field remapping
Entity persistence across time
Distributed tracing (zero config)
Real-time metrics dashboard
GDPR/CCPA handling
Hosted scheduler
Team access controls
Audit log export