Download the PHP package vgip/datanorm without Composer
On this page you can find all versions of the php package vgip/datanorm. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.
Table of contents
Download vgip/datanorm
More information about vgip/datanorm
Files in vgip/datanorm
Download vgip/datanorm
More information about vgip/datanorm
Files in vgip/datanorm
Please rate this library. Is it a good library?
Informations about the package datanorm
Data normalization
Data normalization from some open sources
Installation
System Requirements
You need PHP >= 7.4 but the latest stable version of PHP is recommended
Composer
Functionality list
- Transliteration from Ukrainian into English KMU 2010-01-27 #55
- Kyiv street getter from kga.gov.ua
Transliteration from Ukrainian into English KMU 2010-01-27 #55
Kyiv street getter from kga.gov.ua
Vgip\Datanorm\Parcer\Address\Ukr\Kyiv\StreetNameKga
Get array with normalized data from CSV file
Check and normalized street name data:
- Convert possible apostrophe symbols to one symbol (ʼ - 02BC).
- Check id (forbidden symbols, double). If error see to $this->warning.
- Check street type by whitelist. New type save to $this->warning and this->typeNotFound.
- Check Kyiv district name by whitelist. New Kyiv district name save to $this->warning and this->districtNotFound.
- Check the street names and normalized street names . (if data saved to $this->streetNormalization array)
- Generate $this->nameDouble array - save 2 or more double street name.
- Generate $this->nameList - all unique street names.
- Generate $this->typeCounter - quantity of all street types in Kyiv.
Result array from method getCsvAsArray():
- ['number'] - (int) serial number from file
- ['id'] - (int) identifier from file
- ['name_original'] - (string) street name from file
- ['name'] - (string) normalized street name
- ['type_name'] - (string) street type name from file
- ['type_key'] - (string) street type key
- ['district_string'] - (string) street districts from file
- ['district_list'] - (array) street districts ['district_key', 'district_key', ...]
- ['document_name'] - (string) Document on assigning the name of the object
- ['document_date'] - (string) Date of the document on assigning the name of the object
- ['document_number'] - (string) Number of the document on assigning the name of the object
- ['document_title'] - (string) The title of the document on the naming of the object
- ['place_description'] - (string) Location of the object in the city
- ['name_old'] - (string) Former name of the object
- ['type_old'] - (string)Former category (type) of the object
Example
Ukrainian language
Apostrophe
The resulting data will contain as ukrainian apostrophe symbol "ʼ" unicode symbol U+02BC. All other similar characters in source data (' - U+0027, ’ - U+2019, etc) will be replaced to ʼ (U+02BC). U+02BC - this symbol is used in the ukrainian domain name (ICANN).
Street name normalization
- Position and surname - Академіка Єфремова, Генерала Авдєєнка, Маршала Бірюзова
- Name and surname - Леоніда Бикова
- Family relationships and surname - Братів Зерових, Родини Рудинських
Versioning
Data normalization follows Semantic Versioning.
All versions of datanorm with dependencies
PHP Build Version
Package Version
Requires
php Version
>=5.6.0
The package vgip/datanorm contains the following files
Loading the files please wait ....