PHP download

Download the PHP package onoi/tesa without Composer

On this page you can find all versions of the php package onoi/tesa. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.

Table of contents
Download onoi/tesa
More information about onoi/tesa
Files in onoi/tesa

Vendor onoi
Package tesa
Short Description A simple library to sanitize text elements
License GPL-2.0+
Homepage https://github.com/onoi/tesa

Keywords transliteration

FAQ

After the download, you have to make one include require_once('vendor/autoload.php');. After that you have to import the classes with use statements.

Example:

If you use only one package a project is not needed. But if you use more then one package, without a project it is not possible to import the classes with use statements.

In general, it is recommended to use always a project to download your libraries. In an application normally there is more than one library needed.

Some PHP packages are not free to download and because of that hosted in private repositories. In this case some credentials are needed to access such packages. Please use the auth.json textarea to insert credentials, if a package is coming from a private repository. You can look here for more information.

Some hosting areas are not accessible by a terminal or SSH. Then it is not possible to use Composer.
To use Composer is sometimes complicated. Especially for beginners.
Composer needs much resources. Sometimes they are not available on a simple webspace.
If you are using private repositories you don't need to share your credentials. You can set up everything on our site and then you provide a simple download link to your team member.
Simplify your Composer build process. Use our own command line tool to download the vendor folder as binary. This makes your build process faster and you don't need to expose your credentials for private repositories.

Please rate this library. Is it a good library?

Example code of onoi/tesa

Informations about the package tesa

Tesa (text sanitizer)

The library contains a small collection of helper classes to support sanitization of text or string elements of arbitrary length with the aim to improve search match confidence during a query execution that is required by Semantic MediaWiki project and is deployed independently.

Requirements

PHP 5.3 / HHVM 3.5 or later
Recommended to enable the ICU extension

Installation

The recommended installation method for this library is by adding the following dependency to your composer.json.

Usage

SanitizerFactory is expected to be the sole entry point for services and instances when used outside of this library
IcuWordBoundaryTokenizer is a preferred tokenizer in case the ICU extension is available
NGramTokenizer is provided to increase CJK match confidence in case the back-end does not provide an explicit ngram tokenizer
StopwordAnalyzer together with a LanguageDetector is provided as a means to reduce ambiguity of frequent "noise" words from a possible search index
Synonymizer currently only provides an interface

Contribution and support

If you want to contribute work to the project please subscribe to the developers mailing list and have a look at the contribution guidelinee. A list of people who have made contributions in the past can be found here.

Tests

The library provides unit tests that covers the core-functionality normally run by the continues integration platform. Tests can also be executed manually using the composer phpunit command from the root directory.

Release notes

0.1.0 Initial release (2016-08-07)
- Added SanitizerFactory with support for a
- Tokenizer, LanguageDetector, Synonymizer, and StopwordAnalyzer interface

Acknowledgments

The Transliterator uses the same diacritics conversion table as http://jsperf.com/latinize (except the German diaeresis ä, ü, and ö)
The stopwords used by the StopwordAnalyzer have been collected from different sources, each json file identifies its origin
CdbStopwordAnalyzer relies on wikimedia/cdb to avoid using an external database or cache layer (with extra stopwords being available here)
JaTinySegmenterTokenizer is based on the work of Taku Kudo and his tiny_segmenter.js
TextCatLanguageDetector uses the wikimedia/textcat library to make predictions about a language

License

GNU General Public License 2.0 or later.

All versions of tesa with dependencies

PHP Build Version

Package Version

Version 0.1.0 Release 07. Aug 2016
create-project require 0 people chose require and
0 people chose create-project.

Download

Download latest version of tesa from vendor onoi

Requires php Version >=5.3.2
ext-mbstring Version *
wikimedia/cdb Version ~1.0
wikimedia/textcat Version ~1.1

Composer command for our command line client (download client) This client runs in each environment. You don't need a specific PHP version etc. The first 20 API calls are free. Standard composer command

The package onoi/tesa contains the following files

Loading the files please wait ....