Data Engineer [Ai & Data]
HeyDonto AI API
Fecha: hace 2 semanas
ciudad: Chihuahua, Chihuahua
Tipo de contrato: Tiempo completo

WE ARE LOOKING FOR DATA ENGINEER AI & DATA¡HeyDonto is seeking a talentedData Engineerto join our team and play a vital role in the development of the HeyDonto Data Mapper and the integration of Knowledge Graphs (KGs) with Large Language Models (LLMs).
This position involves working on cutting-edge technologies to standardize and process data from various Electronic Health Record (EHR) systems, enhance data interoperability, and provide contextual insights for personalized medicine.Key ResponsibilitiesData Mapper Development
Data Standardization And Transformation
Convert diverse data structures from various EHR systems into a unified format based on FHIR standards.
Map and normalize incoming data to the FHIR data model, ensuring consistency and completeness.
Kafka Integration
Consume and process events from the Kafka stream, produced by the Data Writer Module.
Deserialize and validate incoming data to ensure adherence to required standards.
Data Segmentation
Separate data streams for warehousing and AI model training, applying specific preprocessing steps for each purpose.
Prepare and validate data for storage and machine learning model training.
Error Handling And Loggin
Implement robust error handling mechanisms to track and resolve data mapping issues.
Maintain detailed logs for auditing and troubleshooting purposes.Knowledge Graphs and LLM Integration:
Data Ingestion And Processing
Use LLMs to extract structured data from EHRs, research articles, and clinical notes.
Ensure semantic consistency and interoperability during data ingestion.
Knowledge Graph Construction
Integrate extracted data into a knowledge graph, representing entities and relationships for semantic data integration.
Implement contextual understanding and querying of complex relationships within the KG.
Advanced Predictive Modeling
Leverage KGs and LLMs to enhance data interoperability and predictive analytics.
Develop frameworks for contextualized insights and personalized medicine recommendations.
Feedback Loop
Continuously update the knowledge graph with new data using LLMs, ensuring up-to-date and relevant insights.Collaboration and Communication
Work Closely with Cross-Functional Teams
Collaborate with data scientists, AI specialists, and software engineers to design and implement data processing solutions.
Communicate effectively with stakeholders to align on goals and deliverables.
Contribute To Engineering Culture
Foster a culture of innovation, collaboration, and continuous improvement within the engineering team.Qualifications
Experience
Proven Experience as a Data Engineer or Similar Role:
Strong background in data processing, standardization, and integration, particularly in healthcare domains.
Experience With FHIR Standards
Familiarity with implementing FHIR-compliant data models and mapping diverse data structures to FHIR resources.
Expertise In Kafka And Streaming Data
Experience with Kafka or similar streaming platforms for real-time data processing and integration.
Knowledge Graph And LLM Experience
Experience working with knowledge graphs and large language models, particularly in healthcare data contexts.
Skills
Strong Problem-Solving Skills:
Ability to design innovative solutions for complex data integration and processing challenges.
Proficiency In Programming Languages
Strong skills in Python or other relevant programming languages for data engineering.
Database And Querying Skills
Proficiency in SQL and experience with both relational and NoSQL databases.
Excellent Communication Skills
Ability to articulate complex technical concepts and collaborate effectively with various stakeholders.
Technical Expertise
Python:Proficient in Python programming, with experience in data processing and integration.
FHIR and HL7:Familiarity with healthcare standards like FHIR and HL7 for data interoperability.
Kafka:Experience with Kafka for streaming data integration and processing.
Knowledge Graphs:Experience with graph databases like Neo4j or RDF-based systems.
Machine Learning:Familiarity with machine learning models and AI frameworks.
Docker and Kubernetes:Experience with containerization and orchestration tools is a plus.Hiring Details:
Work Type:On-Site
City: Guadalajara, Jalisco, Mexico
Salary Offer: Negotiable
English Level: Native or AdvancedIf you are interested in applying, please send your CV in English ******,mentioning the name of the position you are applying for in the subject of the email.
In the body of the email, please include the following information:
Salary expectations
Availability for interview
Availability to join the team
This position involves working on cutting-edge technologies to standardize and process data from various Electronic Health Record (EHR) systems, enhance data interoperability, and provide contextual insights for personalized medicine.Key ResponsibilitiesData Mapper Development
Data Standardization And Transformation
Convert diverse data structures from various EHR systems into a unified format based on FHIR standards.
Map and normalize incoming data to the FHIR data model, ensuring consistency and completeness.
Kafka Integration
Consume and process events from the Kafka stream, produced by the Data Writer Module.
Deserialize and validate incoming data to ensure adherence to required standards.
Data Segmentation
Separate data streams for warehousing and AI model training, applying specific preprocessing steps for each purpose.
Prepare and validate data for storage and machine learning model training.
Error Handling And Loggin
Implement robust error handling mechanisms to track and resolve data mapping issues.
Maintain detailed logs for auditing and troubleshooting purposes.Knowledge Graphs and LLM Integration:
Data Ingestion And Processing
Use LLMs to extract structured data from EHRs, research articles, and clinical notes.
Ensure semantic consistency and interoperability during data ingestion.
Knowledge Graph Construction
Integrate extracted data into a knowledge graph, representing entities and relationships for semantic data integration.
Implement contextual understanding and querying of complex relationships within the KG.
Advanced Predictive Modeling
Leverage KGs and LLMs to enhance data interoperability and predictive analytics.
Develop frameworks for contextualized insights and personalized medicine recommendations.
Feedback Loop
Continuously update the knowledge graph with new data using LLMs, ensuring up-to-date and relevant insights.Collaboration and Communication
Work Closely with Cross-Functional Teams
Collaborate with data scientists, AI specialists, and software engineers to design and implement data processing solutions.
Communicate effectively with stakeholders to align on goals and deliverables.
Contribute To Engineering Culture
Foster a culture of innovation, collaboration, and continuous improvement within the engineering team.Qualifications
Experience
Proven Experience as a Data Engineer or Similar Role:
Strong background in data processing, standardization, and integration, particularly in healthcare domains.
Experience With FHIR Standards
Familiarity with implementing FHIR-compliant data models and mapping diverse data structures to FHIR resources.
Expertise In Kafka And Streaming Data
Experience with Kafka or similar streaming platforms for real-time data processing and integration.
Knowledge Graph And LLM Experience
Experience working with knowledge graphs and large language models, particularly in healthcare data contexts.
Skills
Strong Problem-Solving Skills:
Ability to design innovative solutions for complex data integration and processing challenges.
Proficiency In Programming Languages
Strong skills in Python or other relevant programming languages for data engineering.
Database And Querying Skills
Proficiency in SQL and experience with both relational and NoSQL databases.
Excellent Communication Skills
Ability to articulate complex technical concepts and collaborate effectively with various stakeholders.
Technical Expertise
Python:Proficient in Python programming, with experience in data processing and integration.
FHIR and HL7:Familiarity with healthcare standards like FHIR and HL7 for data interoperability.
Kafka:Experience with Kafka for streaming data integration and processing.
Knowledge Graphs:Experience with graph databases like Neo4j or RDF-based systems.
Machine Learning:Familiarity with machine learning models and AI frameworks.
Docker and Kubernetes:Experience with containerization and orchestration tools is a plus.Hiring Details:
Work Type:On-Site
City: Guadalajara, Jalisco, Mexico
Salary Offer: Negotiable
English Level: Native or AdvancedIf you are interested in applying, please send your CV in English ******,mentioning the name of the position you are applying for in the subject of the email.
In the body of the email, please include the following information:
Salary expectations
Availability for interview
Availability to join the team
Ver más empleos en Chihuahua, Chihuahua