Data Engineer
HeyDonto AI API
Fecha: hace 1 semana
ciudad: Chihuahua, Chihuahua
Tipo de contrato: Tiempo completo

WE ARE LOOKING FOR DATA ENGINEER AI & DATA¡HeyDonto is seeking a talentedData Engineerto join our team and play a vital role in the development of the HeyDonto Data Mapper and the integration of Knowledge Graphs (KGs) with Large Language Models (LLMs).
This position involves working on cutting-edge technologies to standardize and process data from various Electronic Health Record (EHR) systems, enhance data interoperability, and provide contextual insights for personalized medicine.Key ResponsibilitiesData Mapper DevelopmentData Standardization and Transformation :Convert diverse data structures from various EHR systems into a unified format based on FHIR standards.Map and normalize incoming data to the FHIR data model, ensuring consistency and completeness.Kafka Integration:Consume and process events from the Kafka stream, produced by the Data Writer Module.Deserialize and validate incoming data to ensure adherence to required standards.Data Segmentation:Separate data streams for warehousing and AI model training, applying specific preprocessing steps for each purpose.Prepare and validate data for storage and machine learning model training.Error Handling and Loggin:Implement robust error handling mechanisms to track and resolve data mapping issues.Maintain detailed logs for auditing and troubleshooting purposes.Knowledge Graphs and LLM Integration:Data Ingestion and Processing:Use LLMs to extract structured data from EHRs, research articles, and clinical notes.Ensure semantic consistency and interoperability during data ingestion.Knowledge Graph Construction:Integrate extracted data into a knowledge graph, representing entities and relationships for semantic data integration.Implement contextual understanding and querying of complex relationships within the KG.Advanced Predictive Modeling:Leverage KGs and LLMs to enhance data interoperability and predictive analytics.Develop frameworks for contextualized insights and personalized medicine recommendations.Feedback Loop:Continuously update the knowledge graph with new data using LLMs, ensuring up-to-date and relevant insights.Collaboration and CommunicationWork Closely with Cross-Functional TeamsCollaborate with data scientists, AI specialists, and software engineers to design and implement data processing solutions.Communicate effectively with stakeholders to align on goals and deliverables.Contribute to Engineering Culture:Foster a culture of innovation, collaboration, and continuous improvement within the engineering team.QualificationsExperienceProven Experience as a Data Engineer or Similar Role:Strong background in data processing, standardization, and integration, particularly in healthcare domains.Experience with FHIR Standards:Familiarity with implementing FHIR-compliant data models and mapping diverse data structures to FHIR resources.Expertise in Kafka and Streaming Data:Experience with Kafka or similar streaming platforms for real-time data processing and integration.Knowledge Graph and LLM Experience:Experience working with knowledge graphs and large language models, particularly in healthcare data contexts.SkillsStrong Problem-Solving Skills:Ability to design innovative solutions for complex data integration and processing challenges.Proficiency in Programming Languages:Strong skills in Python or other relevant programming languages for data engineering.Database and Querying Skills:Proficiency in SQL and experience with both relational and NoSQL databases.Excellent Communication Skills:Ability to articulate complex technical concepts and collaborate effectively with various stakeholders.Technical ExpertisePython:Proficient in Python programming, with experience in data processing and integration.FHIR and HL7:Familiarity with healthcare standards like FHIR and HL7 for data interoperability.Kafka:Experience with Kafka for streaming data integration and processing.Knowledge Graphs:Experience with graph databases like Neo4j or RDF-based systems.Machine Learning:Familiarity with machine learning models and AI frameworks.Docker and Kubernetes:Experience with containerization and orchestration tools is a plus.Hiring Details:Work Type:On-SiteCity:Guadalajara, Jalisco, MexicoSalary Offer:NegotiableEnglish Level:Native or AdvancedIf you are interested in applying, please send your CV in English ******,mentioning the name of the position you are applying for in the subject of the email.
In the body of the email, please include the following information:Salary expectationsAvailability for interviewAvailability to join the team
This position involves working on cutting-edge technologies to standardize and process data from various Electronic Health Record (EHR) systems, enhance data interoperability, and provide contextual insights for personalized medicine.Key ResponsibilitiesData Mapper DevelopmentData Standardization and Transformation :Convert diverse data structures from various EHR systems into a unified format based on FHIR standards.Map and normalize incoming data to the FHIR data model, ensuring consistency and completeness.Kafka Integration:Consume and process events from the Kafka stream, produced by the Data Writer Module.Deserialize and validate incoming data to ensure adherence to required standards.Data Segmentation:Separate data streams for warehousing and AI model training, applying specific preprocessing steps for each purpose.Prepare and validate data for storage and machine learning model training.Error Handling and Loggin:Implement robust error handling mechanisms to track and resolve data mapping issues.Maintain detailed logs for auditing and troubleshooting purposes.Knowledge Graphs and LLM Integration:Data Ingestion and Processing:Use LLMs to extract structured data from EHRs, research articles, and clinical notes.Ensure semantic consistency and interoperability during data ingestion.Knowledge Graph Construction:Integrate extracted data into a knowledge graph, representing entities and relationships for semantic data integration.Implement contextual understanding and querying of complex relationships within the KG.Advanced Predictive Modeling:Leverage KGs and LLMs to enhance data interoperability and predictive analytics.Develop frameworks for contextualized insights and personalized medicine recommendations.Feedback Loop:Continuously update the knowledge graph with new data using LLMs, ensuring up-to-date and relevant insights.Collaboration and CommunicationWork Closely with Cross-Functional TeamsCollaborate with data scientists, AI specialists, and software engineers to design and implement data processing solutions.Communicate effectively with stakeholders to align on goals and deliverables.Contribute to Engineering Culture:Foster a culture of innovation, collaboration, and continuous improvement within the engineering team.QualificationsExperienceProven Experience as a Data Engineer or Similar Role:Strong background in data processing, standardization, and integration, particularly in healthcare domains.Experience with FHIR Standards:Familiarity with implementing FHIR-compliant data models and mapping diverse data structures to FHIR resources.Expertise in Kafka and Streaming Data:Experience with Kafka or similar streaming platforms for real-time data processing and integration.Knowledge Graph and LLM Experience:Experience working with knowledge graphs and large language models, particularly in healthcare data contexts.SkillsStrong Problem-Solving Skills:Ability to design innovative solutions for complex data integration and processing challenges.Proficiency in Programming Languages:Strong skills in Python or other relevant programming languages for data engineering.Database and Querying Skills:Proficiency in SQL and experience with both relational and NoSQL databases.Excellent Communication Skills:Ability to articulate complex technical concepts and collaborate effectively with various stakeholders.Technical ExpertisePython:Proficient in Python programming, with experience in data processing and integration.FHIR and HL7:Familiarity with healthcare standards like FHIR and HL7 for data interoperability.Kafka:Experience with Kafka for streaming data integration and processing.Knowledge Graphs:Experience with graph databases like Neo4j or RDF-based systems.Machine Learning:Familiarity with machine learning models and AI frameworks.Docker and Kubernetes:Experience with containerization and orchestration tools is a plus.Hiring Details:Work Type:On-SiteCity:Guadalajara, Jalisco, MexicoSalary Offer:NegotiableEnglish Level:Native or AdvancedIf you are interested in applying, please send your CV in English ******,mentioning the name of the position you are applying for in the subject of the email.
In the body of the email, please include the following information:Salary expectationsAvailability for interviewAvailability to join the team
Ver más empleos en Chihuahua, Chihuahua