<aside>
🔥 Я могу вам предложить нечто большее. То!, что больше соответствует вашим талантам!
</aside>
InData Labs https://indatalabs.com/ is a data science firm and AI-powered solutions provider. Our main focus lies in machine learning and deep learning solutions, as well as building high-load data processing systems.
Currently, we are looking for a Data Scientist / Machine Learning Engineer who will be a part of the general-purpose data science team and work with tasks covering a wide variety of business needs with a soft focus on NLP applications.
In this position, you will work with multiple data sources (usually textual, numerical and time-related data), huge and small datasets to develop, validate and deploy machine learning models, tune their performance & integrate them into data processing pipelines.
Join our team of world-class data scientists and data engineers, and challenge yourself with some of the most pressing data science and engineering tasks of modern times!
Responsibilities:
- Deal with both structured and unstructured data, collaborate with data engineers on defining data storage formats, state data collection requirements;
- Not only solve technical tasks but understand business needs and offer appropriate solutions, describe a chosen approach to non-technical people;
- Set up reproducible experiments: selection, training, validation and optimization of machine learning models, evaluation of their quality in business-related terms;
- Integrate data preprocessing and model inference into general data processing pipelines;
- Research new tools, papers, etc. in the machine learning area.
Requirements:
- Strong knowledge and deep understanding of
- Сlassical machine learning (linear models, decision trees, ensembles for classification and regression tasks, clustering and dimensionality reduction)
- Main concepts and stages of the modelling process (validation scheme, regularization, overfitting and generalization, data leaks, feature selection, etc.)
- Experience with Python scientific, visualization and ML-related libraries (numpy, scipy, scikit-learn, etc.)