Cigna Healthcare is dedicated to improving health and vitality, and they are seeking Summer Interns for their Data Science Internship program. As an intern, you will apply advanced analytics and data science techniques to tackle healthcare challenges and work on projects that enhance revenue and member experience.
Use data science techniques that include predictive modeling, machine learning, text mining, segmentation and clustering to help in developing solutions to increase revenue and improve Member Experience.
Utilize Machine Learning for developing next generation of predictive models for targeted business solutions.
Support the team in application of Text Mining to mine Voice of Customer data.
Develop supervised / unsupervised algorithms, Big Data pipelines for making business processes efficient.
Develop Next Generation Supervised Machine Learning Clinical models to predict Medical Non-Adherence.
Training of call classification unsupervised topic models using big data platforms such as Spark and Hadoop.
Leverage state of the art NLP Algorithms for detection of personal health information (PHI) entities within free text comments to automatically de-identify natural language.
Modularize the entity-based feature engineering code base on Hadoop to leverage big data technologies such as Pyspark and Spark SQL.
Qualification
Required
Working towards a Master’s degrees or PhD in quantitative disciplines such as Data Science, Statistics, Applied Mathematics, Computer Science, Bioinformatics, Computational Linguistics or related quantitative disciplines
Excellent ability to query large datasets using ANSI SQL/HIVE SQL and working with relational databases
Proficient programming skills either in SAS, R, or Python
Proficient in Basic Probability and Statistics
Proficient with one or more quantitative data analysis languages such as R, Python
Demonstrated coursework / real world application of analytical methods such as Regression, Naïve-Bayes, Decision trees, experimental designs, support vector machines, machine learning and text mining, Natural Language Processing
Proficiency with relational database concepts and SQL preferred
Proficiency in data manipulation, cleansing and interpretation
Experience working in distributed computing and Big Data Technologies like Hive, Spark, Scala, HDFS
Experience with visualization tools like Tableau, Shiny, ggplot, Matplotlib
Experience with Microsoft Office Suite
Preferred
Benefits
We are a health benefits provider that advocates for better health through every stage of life.