Data Engineer

Ukraine, Canada, Europe

We are seeking a detail-oriented and technically proficient Data Engineer to join our team. This role focuses on structuring, modeling, and analyzing client data, primarily for use in graph-based environments such as Neo4j.

The ideal candidate will have strong data modeling skills, proficiency in Python and database technologies, and the ability to work closely with clients to deliver actionable insights.

Key Responsibilities

Data Structuring & Contribution
– Analyze, understand data structures, and interpret data files (e.g., CSV, PDF, Excel) received from clients.
– Organize and structure raw data into clean, usable formats for downstream modeling and analysis based on domain-driven design (DDD) or data mesh principles.
– Learn and adopt the internal data platform’s workflows and tooling.
– Contribute to platform improvements and maintain alignment with evolving technical standards.
– Work on data governance, metadata management, and documentation of models.
– Work closely with the team lead and development team, primarily on customer-facing, consulting-style projects and platform improvement.

Data Modeling
– Design and prepare data schemas and models, primarily for use with the Neo4j graph database.
– Map client data to the defined Neo4j schema using Pydantic models.
– Define and manage schema evolution, normalization/denormalization, data integrity constraints, and indexing strategies.
– Strong general data modeling skills are essential (Neo4j/graph modeling experience is a plus but not required).

Data Analysis & Visualization
– Perform exploratory data analysis using Neo4j Bloom or similar BI tools.
– Present insights and visualizations to internal teams and clients to support understanding and data-driven decision-making.

Client Interaction & Communication
– Join and participate in customer-facing meetings to gather data requirements and present findings.
– Communicate clearly with clients to discuss datasets, schemas, and analysis results.
– Participate in standups and sync meetings, typically held between 8 am – 1 pm PST.

Required Experience:

Experience with knowledge graphs in both production and non-production projects.
Hands-on experience with Neo4j is preferable. Alternatively, experience with Amazon Neptune, OrientDB, JanusGraph, Azure Cosmos DB, or TigerGraph is also valuable.
Candidates who have experimented with Neo4j and are willing to ramp up in graph technologies quickly are welcome.
Proficient in Python, including libraries such as pandas, polars, pydantic, numpy, typing, and Jupyter Notebook.
Proficient in SQL and Cypher queries.
Strong background in conceptual, logical, and physical data modeling.
Experience designing and implementing relational and non-relational (NoSQL/graph) schemas.
Familiarity with modern AI tools, including LLMs, embeddings, agentic systems, and tooling for unstructured data processing and intelligent automation.

Nice to have

Experience in one or more of the following domains is a strong advantage:
– Financial Services, Cybersecurity, Oil & Gas.
– Familiarity with domain-specific data structures, terminology, and challenges is highly beneficial.

Ready to rumble?

Send your CV or contact us here.