Download the PHP package endless-creativity/elephant-php-neuron without Composer
On this page you can find all versions of the php package endless-creativity/elephant-php-neuron. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.
Download endless-creativity/elephant-php-neuron
More information about endless-creativity/elephant-php-neuron
Files in endless-creativity/elephant-php-neuron
Package elephant-php-neuron
Short Description Neuron AI FileDataLoader reader for .docx files, powered by elephant-php.
License BSD-2-Clause
Homepage https://github.com/endless-creativity/elephant-php-neuron
Informations about the package elephant-php-neuron
elephant-php-neuron
A Neuron AI FileDataLoader reader for .docx
documents, powered by elephant-php.
Drop it into a Neuron RAG pipeline and .docx files become embeddable
documents alongside the bundled PDF, HTML and plain-text readers.
Installation
Requires PHP 8.2+. No external binaries needed (unlike PdfReader).
Usage
Pass a directory and Neuron walks it, picking the right reader per extension; pass a single file to ingest just that one.
Output format
By default the reader returns plain text via
Converter::extractRawText() — paragraphs separated by "\n\n", no
markup. This is usually what you want for embeddings: less syntactic
noise, more semantic signal per token.
If you'd rather preserve headings, lists and links — for example because your splitter or post-processor relies on Markdown structure — request Markdown explicitly through the reader options:
The $options array is forwarded by FileDataLoader to every reader,
so the same flag is in effect for the whole loading pass.
Limitations
- Only OOXML
.docxis supported. Legacy binary.doc(Word 97–2003) is not handled by elephant-php and therefore not by this reader either. - Images embedded in the document are dropped during text extraction. This is intentional for RAG — embeddings are text-only.
- Conversion warnings emitted by elephant-php (
Result::messages) are currently silenced. If you need them, open an issue.
License
BSD-2-Clause. See LICENSE.
All versions of elephant-php-neuron with dependencies
endless-creativity/elephant-php Version ^0.2 || ^0.3
neuron-core/neuron-ai Version ^3.0