Data Scientist in IT — CIS and Europe market
Data Scientist — the second-largest ML/AI role after ML Engineer. Focus is NOT on production infrastructure but on business problem framing: understand the business task, design the experiment, build the model/insight, run A/B tests, communicate with stakeholders. Role family: Junior Data Scientist (ML entry — usually after Analyst), Data Scientist (mid — owns product hypothesis from analysis to model), Senior Data Scientist (end-to-end ownership + multi-team collaboration + A/B-test mastery), Lead Data Scientist/Principal DS (DS strategy for a product area, hiring + mentoring), Product Data Scientist (deep product focus, growth experiments). Stack: Python (dominates), SQL (mandatory — DS reads SQL fluently and writes complex window functions + CTEs), pandas+NumPy+Polars (data manipulation), scikit-learn+XGBoost+LightGBM+CatBoost (classical ML — the core arsenal), PyTorch/TensorFlow (deep learning — bonus, not mandatory for every DS role), statsmodels+SciPy (statistics), Jupyter+VS Code (development), Tableau/Looker/Power BI/Apache Superset/Metabase (visualisation for stakeholders), matplotlib+seaborn+plotly (in-notebook plots), MLflow/Weights & Biases (experiment tracking), A/B-testing platforms (internal or Statsig/Eppo/Optimizely/GrowthBook). According to Zorky CRM, 435 active openings with a median salary of $5985/mo. Top stack: python, go, sql, spark, rust. 76.2% remote. Data Scientist — $4,500-8,000/mo medians, Senior with domain expertise (fintech/healthcare/retail) — premium $7,000-12,000+.
Comparison with other specializations
The AI / ML / Data Science direction contains 6 specializations. The current one (Data Scientist) is highlighted in blue — compare it with its neighbors by the number of open jobs and median salary.
Demand trend
Data Scientist — the stable second-largest AI/ML segment after ML Engineer. Growth 2026 drivers: product companies (Yandex / Avito / OZON — recsys / pricing / fraud), banks (Sber / Tinkoff / Alfa — credit scoring / cross-sell / fraud detection), retail (X5 / Wildberries — demand forecasting / pricing), international product startups via remote channels.
How many new jobs appear each week.
Seniority distribution — trend
How the share of Junior/Middle/Senior/Lead in open jobs shifts week over week. A trend toward Senior usually signals a mature specialization where companies look for ready-made talent; the opposite — a rise in Junior — signals expansion and ground-up team building.
Share of each level in % of all jobs with a stated grade per week.
Salary by level
DS salary ladder: Junior $7796, Middle $4875, Senior $5985, Lead $10506 /mo. Junior — typical via Data Analyst (1-2 years) → Junior DS. Career flow: Junior (1-2 years) → Middle (2-3 years) → Senior → either Lead / Principal DS (strategy + hiring), ML Engineer pivot (technical track), Product Manager pivot, or Founder.
Median salary (USD/month) at each grade plus the jump vs the previous one.
Biggest salary jump — between Senior and Lead (+45.6%).
Salary distribution — trend
The median DS salary — $5985/mo. Most jobs sit at $3-7K (entry-mid). $8K+ — Senior with domain expertise (fintech / healthcare / retail). $10K+ — Senior at international tech companies (Stripe / Airbnb / DoorDash / Spotify / Pinterest / Wise / Revolut). $15K+ — Senior+ Big Tech (Google / Meta / Amazon / Apple DS teams).
What share of jobs each price band holds week over week.
62% of jobs are in the $5–8K range (the core market). High-end $8K+ segment: 24% — usually US-remote or senior-international roles.
Hiring geography
The leader by DS job count is EN (185 positions). Russia — Yandex DS + Sber DS + Tinkoff DS + Avito DS + OZON DS + banks + retail (X5 / WB) + EPAM DS Practice dominate. Poland — DS-friendly EU hub. International remote via Stripe / Airbnb / DoorDash / Spotify / Wise / Revolut.
Job distribution by country.
These numbers reflect the distribution across the sources we parse. Poland often looks dominant because of dense NoFluffJobs / JustJoin.it / Pracuj coverage — the Polish IT market is genuinely large, but in our sample its share is overweighted relative to the real volume of all IT jobs in the region. Same caveat for other top countries: this is «where our parsers look», not «the true size of the market».
Remote / Hybrid / Office — trend
76.2% of DS jobs are remote or hybrid. DS work — cloud data warehouse + Jupyter / VS Code. Outsourcers (EPAM / Luxoft DS Practice) — almost always remote. Russian banks — hybrid/office data security. International tech companies — full-remote standard.
How the share of each work format shifts week over week.
89% — remote. Specialisation is well-adapted to remote format.
Top in-demand technologies
Top DS stack 2026: Python (monopoly), SQL mastery, pandas + NumPy + Polars (data), scikit-learn (classical), XGBoost + LightGBM + CatBoost (gradient boosting workhorses), statsmodels + SciPy (statistics), Jupyter + VS Code, matplotlib + seaborn + plotly (in-notebook), Tableau / Looker / Power BI / Apache Superset / Metabase (stakeholder dashboards), MLflow / W&B (experiment tracking), A/B testing platforms (Statsig / Eppo / Optimizely / GrowthBook), causal inference (DoWhy / EconML — Senior bonus), Bayesian (PyMC — Senior bonus), Prophet / NeuralProphet / statsforecast (time series), Hugging Face (LLM-aware DS).
Technology combinations
Common pairs: Python + pandas + sklearn + XGBoost + Jupyter, SQL + Python + Tableau, LightGBM + statsmodels + scipy + matplotlib, Snowflake + dbt + Python (modern stack), Databricks + PySpark + MLflow, Pinecone / Weaviate + Hugging Face (LLM-aware DS workflow), DoWhy + EconML + pandas (causal inference). Learning roadmap: SQL mastery → Python + pandas → statistics → classical ML (sklearn / XGBoost) → A/B testing mastery → visualisation & storytelling → domain specialisation → causal inference (Senior).
Which pairs of technologies appear together most often in a single job.
Where we see these jobs
DS jobs: hh.ru, Habr Career, getmatch, Djinni, LinkedIn (huge international DS segment), NoFluffJobs / JustJoin.it (Poland), Telegram (@datasciencejobs, @ds_chat, @ds_chat_jobs, @ml_jobs, @mljobs_ru, @aijobs, @jobsforaiml), career pages of EPAM DS Practice / Luxoft DS / Andersen DS, specialised boards aijobs.net + ai-jobs.net + builtin.com/jobs/data-science + kaggle.com/jobs, Toptal network, AI-startup career pages.
Data Scientist vs other directions
Data Scientist overlaps with Data Analyst (entry direction), ML Engineer (production ML — deployment / scaling), Research (novel methodology + papers), Product Manager (product thinking + A/B test mastery). Comparison with ml-engineer/research/mlops — in the SiblingSubnichesChart above.
Volume of open jobs across IT directions.
Latest jobs
Latest open DS jobs — the most recent 10 positions with adequate description quality. The full list is in our CRM or via the "see all" link below.
What we can offer
If you work with Data Scientist jobs or you're in this role yourself — we can close a specific task. Pick a format, leave a contact — we reply within 24 hours.
Frequently asked questions
The most common questions about Data Scientist: pay, DS vs Analyst vs ML Engineer, sklearn vs PyTorch vs XGBoost — when to use which (10-step decision tree), Product DS difference, remote, how to start (6-12 months from Analyst), Senior skills (A/B testing + causal inference + domain expertise). Answers recompute automatically.
How much does a Data Scientist earn in 2026?
The median Data Scientist salary is $5985/mo per Zorky CRM data (435 active jobs). Junior $7796/mo, Middle $4875/mo, Senior $5985/mo, Lead $10506/mo. DS — a stable segment, but growth is lower than ML Engineer due to less rare-skill requirements (SQL + sklearn — a broad entry pool). Senior DS with domain expertise (fintech credit scoring / healthcare modelling / retail demand forecasting) — +25-40% premium. Senior DS at US outsource (EPAM/Luxoft on enterprise analytics) — $6,000-10,000. Lead / Principal DS — $8,000-12,500. Product DS at US tech startups (Stripe/Airbnb/DoorDash analogues) — $8,000-14,000+ Senior. International remote DS (via Toptal network) — $7,000-13,000+ Senior. Big Tech (Google / Meta) Senior DS — $15,000-25,000+ for specific cases.
What does a Data Scientist Junior, Middle, Senior, or Lead earn?
Salary ladder (median USD/mo): Junior $7796/mo, Middle $4875/mo, Senior $5985/mo, Lead $10506/mo. Junior DS — the most accessible entry into the ML/AI direction (typical path: Analyst 1-2 years → Junior DS). The Junior → Middle jump — after the first production model + first independent A/B test. Middle → Senior — multi-project ownership + business impact ownership (revenue / retention / conversion lift documented). Senior → Lead / Principal — DS strategy for a product area + hiring 3-7 DS + mentoring. Career flow: Data Analyst (1-2 years) → Junior DS (1-2 years) → Middle (2-3 years) → Senior → either Lead / Principal DS (deep), ML Engineer pivot (more technical track), Product Manager pivot, or Founder.
How much do Data Scientists earn in Moscow, St Petersburg, remote?
Moscow Senior DS — $5,000-8,500/mo (Yandex — the largest DS employer in Russia: Search analytics, Market recsys, Auto.ru, Dzen; Sber DS; Tinkoff DS; Avito DS; OZON DS; Wildberries DS; VK / Mail.ru DS; X5 Group DS; MTS Big Data DS; Alfa-Bank / Raiffeisen DS). St Petersburg $4,500-7,500 (JetBrains analytics, EPAM SPb). Minsk/Kyiv $4,000-7,000 Senior. Poland €5,000-8,500 gross Senior. Germany €65-95K/yr Senior. 76.2% remote. Outsourcers (EPAM / Luxoft / Andersen DS Practice) — almost always remote, $5,500-9,500 Senior on US projects. International tech companies (via LinkedIn / hired.com / Toptal) — $7,000-13,000+ Senior for Russian-speakers on remote with English. Domain-expertise premiums: fintech (credit scoring / fraud) +20-30%, healthcare (medical ML, FDA compliance) +25-35%, retail (demand forecasting / pricing) +15-25%.
What stack does a Data Scientist most often need?
Top 5: python, go, sql, spark, rust. Python — monopoly (R has almost disappeared from new 2026 jobs, remaining in healthcare/biostats legacy). SQL mastery — mandatory: complex queries with window functions + CTEs + JOIN mastery + query optimisation (DS reads execution plans). Data manipulation: pandas (core), NumPy (numerical), Polars (modern, rising — 10-100× faster for big data), PyArrow. Classical ML — the core arsenal: scikit-learn (model selection / preprocessing / pipelines / metrics), XGBoost + LightGBM + CatBoost (gradient boosting — workhorse for tabular data), statsmodels (regression + time series + statistical tests). Statistics: scipy.stats, statsmodels, causal inference (DoWhy, EconML — Senior bonus). Visualisation: matplotlib + seaborn + plotly (in-notebook), Tableau / Looker / Power BI / Apache Superset / Metabase (for stakeholders). A/B testing: scipy.stats / statsmodels for analysis, internal platforms or Statsig / Eppo / Optimizely / GrowthBook. Understanding of power analysis, sample size calculation, multiple testing correction, sequential testing. Deep Learning (bonus, not mandatory): PyTorch / TensorFlow for CV / NLP DS tasks. Experiment tracking: MLflow or Weights & Biases. BI tools / dashboards: Tableau / Looker / Apache Superset / Metabase / Mode / Hex. Notebooks: Jupyter + VS Code (the 2026 market is migrating from raw Jupyter to hybrid). Cloud DS workbenches: AWS SageMaker Studio / GCP Vertex AI Workbench / Databricks / Snowflake Snowpark. Senior extras: causal inference (DoWhy / EconML / CausalImpact), uplift modelling, Bayesian modelling (PyMC / Stan), time series (Prophet / NeuralProphet / statsforecast).
Data Scientist vs Analyst vs ML Engineer — what's the difference?
Data Analyst — focus on reporting + dashboards + ad-hoc analysis. SQL + Excel + Tableau / Looker / Power BI. Less ML-heavy. Pay $2,500-5,000. Fits entry into the data direction. Data Scientist (this page) — focus on business problem framing + statistical modelling + A/B testing + insights communication. SQL deep + Python + sklearn + XGBoost + statsmodels + visualisation. Pay $4,500-8,000. Does ML, but doesn't own production infrastructure (deployment / serving / monitoring — that's the ML Engineer). ML Engineer — focus on production ML pipelines + deployment + scaling + LLM/RAG infrastructure. Stack DS + Docker / K8s + MLflow + Airflow + cloud-managed ML. Pay $5,500-9,500. See ML Engineer. Career flow: Analyst → DS Junior → DS Middle/Senior. DS ↔ ML Engineer pivot in 3-6 months (learn MLOps + Docker / K8s + cloud ML). DS ↔ Product Manager pivot in 2-4 months (DS already owns product thinking + A/B testing — add roadmap planning + stakeholder management). DS → Research Engineer / Scientist — requires PhD or 3+ years of deep specialisation.
Sklearn vs PyTorch vs XGBoost — when to use which?
Decision tree for picking the DS toolkit in 2026: 1) Tabular data, classification/regression, <1M rows → scikit-learn (LogisticRegression / RandomForest / GradientBoosting). Pipelines + preprocessing — sklearn wins for simple integration. 2) Tabular data, >100K rows, performance critical → XGBoost / LightGBM / CatBoost. The standard 2026 workhorse for tabular tasks: credit scoring, churn prediction, conversion, fraud detection, demand forecasting. LightGBM — fastest, XGBoost — most mature ecosystem, CatBoost — best for categorical features without preprocessing. 3) Time series → Prophet / NeuralProphet (Facebook) or statsforecast (Nixtla — state-of-the-art classical) or ML approach (LightGBM on feature-engineered time series — often beats specialised libraries). 4) NLP / text classification → Hugging Face Transformers (fine-tune small BERT/DeBERTa/RoBERTa) or scikit-learn TF-IDF + LogisticRegression if <10K docs (simpler baseline). 5) Computer Vision → PyTorch + torchvision (image classification / detection / segmentation). 6) LLM / RAG / generative → Hugging Face + LangChain + LlamaIndex + OpenAI / Anthropic / Cohere APIs. For production — vLLM / TGI. 7) Recommender systems → LightGBM / XGBoost on feature-engineered users × items (the 2026 workhorse), implicit / LightFM (collaborative filtering classics), PyTorch deep recsys (NeuMF / Two-Tower / Transformer4Rec). 8) Causal inference / uplift modelling → DoWhy / EconML / CausalML. 9) Bayesian modelling (when uncertainty quantification matters — medical / finance) → PyMC / Stan / NumPyro. 10) Anomaly detection → PyOD + Isolation Forest + LightGBM-based approaches. Default 2026: 80% of tabular tasks are solved by LightGBM / XGBoost / CatBoost; deep learning — for CV / NLP / recsys.
Can Data Scientists work remotely?
Yes, 76.2% of DS jobs are full-remote or hybrid. DS work — Jupyter + cloud data warehouse (Snowflake / BigQuery / Databricks / Redshift) + dashboards. Outsourcers (EPAM / Luxoft / Andersen DS Practice) — almost always remote on US projects. Russian product companies (Yandex / Sber / Tinkoff / Avito / OZON DS) — hybrid or remote after probation. Russian banks (Sber / VTB / Alfa DS) — hybrid/office due to data security compliance. International tech companies — full-remote standard. Relocant hubs: Poland (Warsaw/Krakow — DS-friendly), Germany (Berlin / Munich), Canada (Toronto / Vancouver), Serbia, Georgia, UAE. English for international DS remote — must (premium +25-40%, and DS roles require stakeholder communication — English audible).
How is Product Data Scientist different from general DS?
Product Data Scientist — a DS with deep product focus + ownership of growth experiments + retention modelling + feature-impact analysis. Works closely with PM and Engineering. Stack is the same, but the focus: A/B-test mastery (multi-variant experiments, sequential testing, multiple testing correction, power analysis), growth metrics (DAU/MAU/retention curves/funnel analysis), causal inference for observational data. Pay comparable to Senior DS (median), but at US product startups Product DS often exceeds general DS Senior thanks to business-impact ownership. General Data Scientist — can be in any domain (banking risk, retail demand, manufacturing quality). Research DS (or just "DS" in a research context) — focus on novel modelling approaches + papers (often PhD track). See also Research Engineer / Scientist. Career choice: Product DS if product impact is interesting and you enjoy stakeholder communication, general DS for domain flexibility, Research DS for new methodology research. All three are legitimate Senior+ paths.
Which companies actively hire Data Scientists?
At the top: Yandex, Sber, Avito. Yandex DS — the largest employer in Russia (Search analytics, Market recsys, Auto.ru, Dzen, Drive, Yandex.GO analytics, Music recsys). Sber DS (banking analytics, risk modelling, marketing analytics, GigaChat product analytics). Tinkoff DS (credit scoring + transaction analytics + cross-sell modelling). Avito DS (recsys + ranking + ML pricing + fraud detection). OZON DS (recsys + demand forecasting + supply chain). Wildberries DS (recsys + pricing). VK / Mail.ru DS (feed recsys + AI for products). X5 Group DS (retail analytics — pricing / inventory / promo effectiveness). MTS Big Data DS (telco analytics + insights products). Alfa-Bank / Raiffeisen / VTB DS. JetBrains (product analytics + ML for IDE features). Outsourcers: EPAM DS Practice (the largest DS outsource in CIS), Luxoft DS, Andersen DS, DataArt DS on US projects. International tech companies (premium full-remote): Stripe, Airbnb, DoorDash, Spotify, Pinterest, Lyft, Square / Block, Wise, Revolut. Y Combinator startups DS — premium remote. Big Tech (Google / Meta / Amazon / Apple) Senior DS — top $15,000-25,000+ for Russian-speaking Seniors with English.
Where to start in Data Science in 2026?
Roadmap: 1) SQL mastery — must-have (window functions / CTEs / JOIN strategy / query optimisation). Book "SQL for Data Scientists" Tanimura. Course "Mode SQL Tutorial" (free). 2) Python for DS: pandas + NumPy mastery, not just syntax but performance patterns (vectorisation, .apply pitfalls). Book "Python for Data Analysis" McKinney. 3) Statistics: descriptive + inferential + hypothesis testing + ANOVA + linear regression deep. Course "Statistical Learning" Hastie / Tibshirani (StanfordOnline — free). Book "Practical Statistics for Data Scientists" Bruce. 4) Classical ML: scikit-learn mastery (pipelines / cross-val / metrics trade-offs / calibration), XGBoost / LightGBM / CatBoost (gradient boosting workhorses). Andrew Ng "Machine Learning Specialization" (Coursera) + "Introduction to Statistical Learning" Hastie (ISL — free PDF). 5) A/B testing mastery — power analysis, sample size, sequential testing, multiple testing correction, CUPED variance reduction. Books: "Trustworthy Online Controlled Experiments" Kohavi (must-read), "Statistical Methods in Online A/B Testing" Georgi. 6) Visualisation & storytelling — matplotlib + seaborn + plotly for notebooks + Tableau / Looker / Power BI / Apache Superset for stakeholders. Books: "Storytelling with Data" Knaflic, "The Visual Display of Quantitative Information" Tufte (classic). 7) Domain specialisation — pick one of fintech / healthcare / retail / telco / e-commerce and study domain-specific challenges. 8) Deep learning bonus: PyTorch basics for CV / NLP DS tasks. 9) Causal inference (Senior bonus): "The Effect" Huntington-Klein (free online textbook), DoWhy / EconML libraries. 10) Pet project: end-to-end DS project on Kaggle (data exploration → feature engineering → model selection → calibration → business presentation) + 1 A/B test simulation project. RU courses: Karpov.Courses "Data Scientist", Otus "Data Scientist", SkillFactory "DS Professional", Yandex.Practicum "Data Science", SberUniversity — School21 AI. International (EN): DeepLearning.AI (Andrew Ng), fast.ai, Coursera IBM Data Science Professional. Analyst → Junior DS — 6-12 months. Backend Middle → Junior DS — 8-14 months (math + stats gap).
How many Data Scientist jobs are open across CIS and Europe?
435 active open DS positions. Geography: EN, 🇵🇱 Poland, INT. Sources: hh.ru, Habr Career, getmatch, Djinni, LinkedIn (huge international DS segment), NoFluffJobs / JustJoin.it (Poland), Telegram (@datasciencejobs, @ds_chat, @ds_chat_jobs, @ml_jobs, @mljobs_ru, @aijobs, @jobsforaiml), career pages of EPAM / Luxoft / Andersen DS Practice, specialised boards (aijobs.net, ai-jobs.net, builtin.com/jobs/data-science, kaggle.com/jobs), Toptal network. The real market is broader thanks to a huge international remote segment (Stripe / Airbnb / DoorDash / Spotify / Wise / Revolut DS teams — full-remote-friendly). Time to close a Senior DS role — 4-10 weeks (faster than ML Engineer thanks to broader entry pool).
What skills does a Senior Data Scientist need?
A Senior DS owns the full cycle from business problem to production model + technical leadership. SQL mastery: window functions / CTEs / complex JOINs / query optimisation / reading execution plans. Python for DS: pandas / NumPy / Polars performance patterns, vectorisation, profile-driven optimisation. Statistics mastery: hypothesis testing depth, Bayesian thinking, causal inference (DoWhy / EconML), Bayesian modelling basics (PyMC). Classical ML mastery: deep feature engineering, model selection rationale, calibration, business-aligned metrics, gradient boosting hyperparameter mastery (XGBoost / LightGBM / CatBoost). A/B testing mastery: power analysis, sequential testing, multiple testing correction, CUPED variance reduction, network effects handling, novelty/primacy effects, switchback experiments for marketplaces, geo-experiments. "Trustworthy Online Controlled Experiments" Kohavi — the desk book. Deep Learning (bonus): PyTorch for CV / NLP DS tasks. LLM-aware DS: prompt engineering, RAG basics, LLM-as-judge for evaluation, LLM applications in DS workflow (data exploration / synthetic data / labelling assistance). Visualisation & communication: storytelling-with-data mastery — convert results into business decisions for non-technical stakeholders. Cloud DS: one of AWS SageMaker / GCP Vertex AI / Databricks / Snowflake deeply. Soft: stakeholder management (product / business / engineering), technical writing (analysis docs, post-experiment reports, decision memos), mentoring Middle DS, hiring (interviewing DS candidates), executive communication. Domain expertise: deep understanding of one-two domains (fintech / healthcare / retail / telco / e-commerce) — the main premium driver for Senior+. English for Senior+ MUST — DS roles are intensely stakeholder-facing, and most materials / papers / community are in English.
Similar specializations
Methodology
- Data period: in the hero and copy — the last 3 months. In the charts — the full available observation period (since parsers were launched, usually 2-3 months).
- Data is collected automatically from 1000+ sources — Telegram channels and job boards across CIS and Europe.
- Only live open jobs with a clear description are counted. Spam and duplicates are filtered out.
- Salaries are converted to USD/month at the current rate. Outlier values (lt;500 or gt;50K) are filtered out.
- Levels are normalized: Mid → Middle, Intern/Trainee → Junior, Principal/Staff/Expert → Lead.
- The first 2 weeks of data (parser ramp-up period) are not shown in the charts.
- Data is recomputed every day.
Authorship and citation
Analytics prepared by Zorky Research Team. Last updated: May 29, 2026 at 5:40 PM.
Data sources and methodology
Data is collected automatically from 1000+ sources — Telegram job channels and job boards across CIS and Eastern Europe (HH, Habr Career, Djinni, DOU, NoFluffJobs, JustJoin.it, Pracuj.pl and others). Parsing runs 24/7, duplicates are filtered by description and URL, salary outliers are stripped. Detailed methodology — on the "How it works" page.
Zorky CRM (2026). Data Scientist in IT: CIS and Europe market. Accessed: 5/29/2026. URL: https://zorky.tech/en/research/ml