PHP download

Download the PHP package llm-html-extractor/symfony-bundle without Composer

On this page you can find all versions of the php package llm-html-extractor/symfony-bundle. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.

Table of contents
Download llm-html-extractor/symfony-bundle
More information about llm-html-extractor/symfony-bundle
Files in llm-html-extractor/symfony-bundle

Vendor llm-html-extractor
Package symfony-bundle
Short Description Symfony bundle for extracting structured data from HTML using LLM providers
License MIT

Keywords symfony parser bundle html extraction extractor scraping scrapper ai jina llm jina-reader

FAQ

After the download, you have to make one include require_once('vendor/autoload.php');. After that you have to import the classes with use statements.

Example:

If you use only one package a project is not needed. But if you use more then one package, without a project it is not possible to import the classes with use statements.

In general, it is recommended to use always a project to download your libraries. In an application normally there is more than one library needed.

Some PHP packages are not free to download and because of that hosted in private repositories. In this case some credentials are needed to access such packages. Please use the auth.json textarea to insert credentials, if a package is coming from a private repository. You can look here for more information.

Some hosting areas are not accessible by a terminal or SSH. Then it is not possible to use Composer.
To use Composer is sometimes complicated. Especially for beginners.
Composer needs much resources. Sometimes they are not available on a simple webspace.
If you are using private repositories you don't need to share your credentials. You can set up everything on our site and then you provide a simple download link to your team member.
Simplify your Composer build process. Use our own command line tool to download the vendor folder as binary. This makes your build process faster and you don't need to expose your credentials for private repositories.

Please rate this library. Is it a good library?

Example code of llm-html-extractor/symfony-bundle

Informations about the package symfony-bundle

LLM HTML Extractor Symfony Bundle

A powerful Symfony bundle for extracting structured data from HTML using LLM (Large Language Model) providers with a plugin architecture.

Features

LLM-Based Extraction: Uses LLM providers (starting with Jina Reader) to extract structured data from HTML
Type-Safe DTOs: Define extraction schemas using PHP attributes on your DTOs
Hybrid Extraction: Easily combine LLM extraction with code-based extraction - use AI for complex fields and DomCrawler/XPath for simple structured data
Extensible: Plugin architecture allows custom extractors for specific use cases
Cacheable: Built-in caching support for LLM responses
Logging: Optional logging for LLM requests/responses and cache operations
Configurable: Flexible configuration for different LLM providers and caching strategies

Installation

Configuration

Create or update config/packages/llm_html_extractor.yaml:

Alternatively, you can use an existing HTTP client service:

Using a Custom LLM Client

To use your own LLM client implementation, just set the client parameter to your service ID:

Your custom client must implement LlmHtmlExtractor\SymfonyBundle\Client\LlmClientInterface. The bundle will validate this during container compilation and throw a clear error if the interface is not implemented.

Logging

The bundle provides comprehensive logging for debugging and monitoring:

Request/Response Logging: When logs.enabled: true, all LLM requests and responses are logged at info level
Cache Operations: Cache hits and misses are logged when both caching and logging are enabled
Error Logging: Failed LLM requests are logged at error level with exception details

The decorators are applied in this order:

Base LLM Client (e.g., JinaReaderLlmClient)
LoggingLlmClient (if logs enabled) - logs requests/responses
CacheableLlmClient (if cache enabled) - logs cache hits/misses

This means logged requests show the actual LLM calls (cache misses), not cached responses.

Usage

1. Define Your Extraction DTO

2. Use the Extraction Handler

3. Create Custom Extractors (Optional)

For specific extraction needs, implement the FromHtmlExtractorInterface:

Supported LLM Providers

Currently supported:

Jina Reader (jinaai/readerlm-v2, jinaai/readerlm-v1.5)
- Uses vLLM OpenAI API standard endpoint (/openai/v1/chat/completions)
- Tested with Runpod serverless deployments
- Compatible with any vLLM deployment following the OpenAI API standard

License

MIT

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

All versions of symfony-bundle with dependencies

PHP Build Version

Package Version

Version 0.1 Release 19. Oct 2025
create-project require 0 people chose require and
0 people chose create-project.

Download

Download latest version of symfony-bundle from vendor llm-html-extractor

Requires php Version >=8.2
symfony/dependency-injection Version ^6.4|^7.0
symfony/config Version ^6.4|^7.0
symfony/http-kernel Version ^6.4|^7.0
symfony/http-client Version ^6.4|^7.0
symfony/property-access Version ^6.4|^7.0
symfony/property-info Version ^6.4|^7.0
symfony/serializer Version ^6.4|^7.0
symfony/cache Version ^6.4|^7.0
symfony/dom-crawler Version ^6.4|^7.0
symfony/yaml Version ^6.4|^7.0

Composer command for our command line client (download client) This client runs in each environment. You don't need a specific PHP version etc. The first 20 API calls are free. Standard composer command

The package llm-html-extractor/symfony-bundle contains the following files

Loading the files please wait ...