- Experience working within the Azure ecosystem, including Azure AI Search, Azure Storage Blob, Azure Postgres and understanding how to leverage them for data processing, storage, and analytics tasks.
- Experience with techniques such as data normalization, feature engineering, and data augmentation.
- Ability to preprocess and clean large datasets efficiently using Azure Tools /Python and other data manipulation tools.
- Expertise in working with healthcare data standards (ex. HIPAA and FHIR), sensitive data and data masking techniques to mask personally identifiable information (PII) and protected health information (PHI) is essential.
- In-depth knowledge of search algorithms, indexing techniques, and retrieval models for effective information retrieval tasks. Familiarity with search platforms like Elasticsearch or Azure AI Search is a must.
- Familiarity with chunking techniques and working with vectors and vector databases like Pinecone.
- Experience working within the snowflake ecosystem.
- Ability to design, develop, and maintain scalable data pipelines for ingesting, processing, and transforming large volumes of structured and unstructured data.
- Experience with implementing best practices for data storage, retrieval, and access control to ensure data integrity, security, and compliance with regulatory requirements.
- Be able to implement efficient data processing workflows to support the training and evaluation of solutions using large language models, ensuring reliability, scalability, and performance.
- Ability to proactively identify and address issues related to data quality, pipeline failures, or resource contention, ensuring minimal disruption to systems.
- Experience with large language model frameworks, such as Langchain and know how to integrate them into data pipelines for natural language processing tasks.
You received this message because you are subscribed to the Google Groups "Latest C2C Requirements2" group.
To unsubscribe from this group and stop receiving emails from it, send an email to latest-c2c-requirements2+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/latest-c2c-requirements2/CABPbW4BaNaK0y1dphTNCKjCEo9nXYRkAK_wHwDedwGAaqfdCAA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.