Data Engineer [Ai & Data]

HeyDonto AI API


Fecha: hace 2 semanas
ciudad: Chihuahua, Chihuahua
Tipo de contrato: Tiempo completo
WE ARE LOOKING FOR DATA ENGINEER AI & DATA¡HeyDonto is seeking a talentedData Engineerto join our team and play a vital role in the development of the HeyDonto Data Mapper and the integration of Knowledge Graphs (KGs) with Large Language Models (LLMs).

This position involves working on cutting-edge technologies to standardize and process data from various Electronic Health Record (EHR) systems, enhance data interoperability, and provide contextual insights for personalized medicine.Key ResponsibilitiesData Mapper Development

Data Standardization And Transformation

Convert diverse data structures from various EHR systems into a unified format based on FHIR standards.

Map and normalize incoming data to the FHIR data model, ensuring consistency and completeness.

Kafka Integration

Consume and process events from the Kafka stream, produced by the Data Writer Module.

Deserialize and validate incoming data to ensure adherence to required standards.

Data Segmentation

Separate data streams for warehousing and AI model training, applying specific preprocessing steps for each purpose.

Prepare and validate data for storage and machine learning model training.

Error Handling And Loggin

Implement robust error handling mechanisms to track and resolve data mapping issues.

Maintain detailed logs for auditing and troubleshooting purposes.Knowledge Graphs and LLM Integration:

Data Ingestion And Processing

Use LLMs to extract structured data from EHRs, research articles, and clinical notes.

Ensure semantic consistency and interoperability during data ingestion.

Knowledge Graph Construction

Integrate extracted data into a knowledge graph, representing entities and relationships for semantic data integration.

Implement contextual understanding and querying of complex relationships within the KG.

Advanced Predictive Modeling

Leverage KGs and LLMs to enhance data interoperability and predictive analytics.

Develop frameworks for contextualized insights and personalized medicine recommendations.

Feedback Loop

Continuously update the knowledge graph with new data using LLMs, ensuring up-to-date and relevant insights.Collaboration and Communication

Work Closely with Cross-Functional Teams

Collaborate with data scientists, AI specialists, and software engineers to design and implement data processing solutions.

Communicate effectively with stakeholders to align on goals and deliverables.

Contribute To Engineering Culture

Foster a culture of innovation, collaboration, and continuous improvement within the engineering team.Qualifications

Experience

Proven Experience as a Data Engineer or Similar Role:

Strong background in data processing, standardization, and integration, particularly in healthcare domains.

Experience With FHIR Standards

Familiarity with implementing FHIR-compliant data models and mapping diverse data structures to FHIR resources.

Expertise In Kafka And Streaming Data

Experience with Kafka or similar streaming platforms for real-time data processing and integration.

Knowledge Graph And LLM Experience

Experience working with knowledge graphs and large language models, particularly in healthcare data contexts.

Skills

Strong Problem-Solving Skills:

Ability to design innovative solutions for complex data integration and processing challenges.

Proficiency In Programming Languages

Strong skills in Python or other relevant programming languages for data engineering.

Database And Querying Skills

Proficiency in SQL and experience with both relational and NoSQL databases.

Excellent Communication Skills

Ability to articulate complex technical concepts and collaborate effectively with various stakeholders.

Technical Expertise

Python:Proficient in Python programming, with experience in data processing and integration.

FHIR and HL7:Familiarity with healthcare standards like FHIR and HL7 for data interoperability.

Kafka:Experience with Kafka for streaming data integration and processing.

Knowledge Graphs:Experience with graph databases like Neo4j or RDF-based systems.

Machine Learning:Familiarity with machine learning models and AI frameworks.

Docker and Kubernetes:Experience with containerization and orchestration tools is a plus.Hiring Details:

Work Type:On-Site

City: Guadalajara, Jalisco, Mexico

Salary Offer: Negotiable

English Level: Native or AdvancedIf you are interested in applying, please send your CV in English ******,mentioning the name of the position you are applying for in the subject of the email.

In the body of the email, please include the following information:

Salary expectations

Availability for interview

Availability to join the team
Publicar un currículum