Retail company
Our client is an Indian company engaged in sales in the non-grocery segment
The main need of the Client was to find software that automatically generates arrays of financial and non-financial data of companies for subsequent use in machine learning models
The software allows to automatically generate massive of data on companies for subsequent use in machine learning models from a wide range of various open data sources. The downloaded data massive is analyzed for the presence of abnormal values and outliers. It also enables for constant updating of the generated data massive, as well as downloading information on new lists of companies. If necessary, it is also possible to form additional synthetic data sets consisting of artificially generated observations. The generated sets have the same statistical characteristics as the reference sets from initial data
Project Target
The software allows accumulating data arrays, as well as clearing their abnormal values and outliers, followed by updating the current information. As part of the interaction, the Client has satisfied the following needs:
- Automatic data collection for requested companies
- Regular updating of the data array for use in machine learning models
- Formation of synthetic data sets with similar characteristics
- The final format of the date sets in csv format for data science methods
As a result of using our product to generate datasets, the Client was able to collect a significant sample of data to improve the accuracy and quality of the machine learning models used





