Scientist serp_jobs.h1.location_city
serp_jobs.job_alerts.create_a_job
Scientist • indianapolis in
Data Scientist
CData SoftwareIndianapolis, IN, United StatesData Scientist, Privacy
DatavantIndianapolis, IN, United StatesData Scientist 4
OracleIndianapolis, IN, United StatesScientist LCMS
IQVIAIndianapolis, Indiana, USAPrincipal Quantitative Scientist
RocheIndianapolis, Indiana, USA- serp_jobs.job_card.promoted
Medical Laboratory Scientist
Pride HealthIndianapolis, IN, USScientist / Sr Scientist - Oligonucleotide Chemistry
Eli Lilly and CompanyIndianapolis, Indiana, United Statesdata Scientist
innovitusaIndianapolis, Indiana, USAData Scientist II
CoinbaseIndianapolis, IN, United StatesMedical Lab Scientist
CYNET SystemsIndianapolis, IN, US- serp_jobs.job_card.promoted
Data Scientist
Diverse LynxIndianapolis, IN, United StatesProject Geologist / Scientist
StantecIndianapolis, IN, USData Scientist
Elanco Animal Health IncorporatedIndianapolis, IN, United States- serp_jobs.job_card.promoted
Lead Data Scientist
HumanaIndianapolis, IN, United States- serp_jobs.job_card.promoted
Senior Data Scientist
Cardinal HealthIndianapolis, IN, United States- serp_jobs.job_card.promoted
Data Scientist
METAIndianapolis, IN, United States- serp_jobs.job_card.promoted
Data Scientist, Marketing
ConfluentIndianapolis, IN, United States- serp_jobs.job_card.promoted
Junior Data Scientist
SynergisticITIndianapolis, IN, United States- serp_jobs.job_card.promoted
Data Scientist
Vimerse InfoTech IncIndianapolis, IN, United StatesData Scientist
CData SoftwareIndianapolis, IN, United States- serp_jobs.job_card.full_time
Role name : Data Scientist
Role Description :
1. Create and set best practices for data ingestion, integration, and access patterns to support both real-time and batch-based consumer data needs
2. Assist with design and lead development on scalable, high-performance data architecture solutions that supports both the consumer side of the business as well as analytic use cases
3. Create comprehensive documentation for design, and processes to support ongoing maintenance and knowledge sharing for both GMP and non-GMP solutions.
4. Drive continuous data transformation to minimize technical debt
5. Responsible for creation of test protocols / test scripts and other validation deliverables.
6. Provide technical support to local end users on Data pipelines and Advanced Analytics Solutions developed
Competencies :
Digital : Python, Digital : Apache Spark, Digital : Kafka
Experience (Years) : 6-8
Essential Skills :
- ? Demonstrated experience in designing and implementing complex data systems from the ground up.
- ? Strong experience with programming languages, such as Python, SQL & Spark
- ? Experience with building batch and streaming pipelines using complex SQL, PySpark, Pandas, and similar frameworks
- ? Develop, refine, and optimize Advanced Analytics Solutions using machine learning models to extract insights from complex data sources.
- ? Transform data using SQL, NoSQL, and Python. Visualizing data using a diverse tool set including but not limited to Python and R.
- ? Experience with cloud services in AWS and / or Microsoft Azure? Experience with message brokers and event-driven architectures (e.g, MQTT, Kafka, RabbitMQ)
- ? Experience in handling data streams, APIs, events, container orchestration products such as OpenShift, EKS, ECS.
- ? Experience testing, troubleshooting & establishing API connectivity utilizing software documentation and tools such as Postman
- ? Strong experience transforming data using common ETL / ELT patterns
- ? Experience with orchestrating complex workflows and data pipelines using like Airflow or similar tools
- ? Knowledge and / or experience in predictive modeling and machine learning is a plus.
- ? Manufacturing Pharma experience is a plus
Location : Indianapolis, IN
Keywords :
Python, SQL, PySpark, Pandas, PySpark, Pandas,OpenShift, EKS, ECS,Databricks