Fall Term Schedule
Only courses with a DSC course number are listed on this page. See MS program for all of the required and elective courses for the degree.
Fall 2026
| Number | Title | Instructor | Time |
|---|
|
DSCC 401-01
Brendan Mort
MW 9:00AM - 10:15AM
|
|
This course provides a hands-on introduction to widely-used tools for data science. Topics include Linux; languages and packages for statistical analysis and visualization; cluster and parallel computing including GPUs; Hadoop and Spark; libraries for machine learning; NoSQL databases; and cloud services. PREREQUISITES: Programming experience strongly recommended.
|
|
DSCC 420-01
Gonzalo Mateos Buckstein
MW 4:50PM - 6:05PM
|
|
The goal of this course is to learn how to model, analyze and simulate stochastic systems, found at the core of a number of disciplines in engineering, for example communication systems, stock options pricing and machine learning. This course is divided into five thematic blocks: Introduction, Probability review, Markov chains, Continuous-time Markov chains, and Gaussian, Markov and stationary random processes.
|
|
DSCC 435-01
Jiaming Liang
TR 9:40AM - 10:55AM
|
|
This course primarily focuses on algorithms for large-scale optimization problems arising in machine learning and data science applications. The first part will cover first-order methods including gradient and subgradient methods, mirror descent, proximal gradient method, accelerated gradient method, Frank-Wolfe method, and inexact proximal point methods. The second part will introduce algorithms for nonconvex optimization, stochastic optimization, distributed optimization, manifold optimization, reinforcement learning, and those beyond first-order.
|
|
DSCC 440-02
Monika Polak
TR 2:00PM - 3:15PM
|
|
Fundamental concepts and techniques of data mining, including data attributes, data visualization, data pre-processing, mining frequent patterns, association and correlation, classification methods, and cluster analysis. Advanced topics include outlier detection, stream mining, and social media data mining. CSC 440, a graduate-level course, requires additional readings and a course project.
|
|
DSCC 461-01
Eustrat Zhupa
TR 12:30PM - 1:45PM
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. Recitation Now Required
|
|
DSCC 461-02
Eustrat Zhupa
T 6:15PM - 7:30PM
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. Recitation Now Required
|
|
DSCC 461-03
Eustrat Zhupa
W 3:25PM - 4:40PM
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. Recitation Now Required
|
|
DSCC 461-04
Eustrat Zhupa
W 4:50PM - 6:05PM
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. Recitation Now Required
|
|
DSCC 461-05
Eustrat Zhupa
R 4:50PM - 6:05PM
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. Recitation Now Required
|
|
DSCC 461-06
Eustrat Zhupa
R 6:15PM - 7:30PM
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. Recitation Now Required
|
|
DSCC 462-01
Anson Kahng
TR 4:50PM - 6:05PM
|
|
This course will cover foundational concepts in descriptive analyses, probability, and statistical inference. Topics to be covered include data exploration through descriptive statistics (with a heavy emphasis on using R for such analyses), elementary probability, diagnostic testing, combinatorics, random variables, elementary distribution theory, statistical inference, and statistical modeling. The inference portion of the course will focus on building and applying hypothesis tests and confidence intervals for population means, proportions, variances, and correlations. Non-parametric alternatives will also be introduced. The modeling portion of the course will include ANOVA, and simple and multiple regression and their respective computational methods. Students will be introduced to the R statistical computing environment.
|
|
DSCC 465-01
Yukun Ma
TR 2:00PM - 3:15PM
|
|
The course provides an introduction to modern machine learning concepts, techniques, and algorithms. Topics discussed include regression, clustering and classification, kernels, support vector machines, feature selection, goodness of fit, neural networks. Programming assignments emphasize taking theory into practice, through applications on real-world data sets. Students will be expected to work with Python programming environment to complete the assignments. PRE-REQUISITES: DSCC 462, STAT 190 or equivalent introductory statistics background.
|
|
DSCC 475-01
Ajay Anand
TR 11:05AM - 12:20PM
|
|
Time series analysis is a valuable data analysis technique in a variety of industrial (e.g., prognostics and health management), business (e.g., financial data analysis) and healthcare (e.g., disease progression modeling) applications. Moreover, forecasting in time series is an essential component of predictive analytics. The course will begin with an introduction to practical aspects relevant to time series data analysis such as data collection, characterization, and preprocessing. Topics covered will include smoothing methods (moving average, exponential smoothing), trend and seasonality in regression models, autocorrelation, AR and ARIMA models applied to time series data. Deep learning models including feedforward, recurrent, gated and convolutional architectures will also be studied. Students shall work on projects with time-series data sets using modeling tools in Python. PREREQUISITES: Introductory statistics; Linear algebra and differential equations;applied Python programming. ECE graduate students may use as elective towards concentrations in: Machine Learning and Artificial Intelligence; Signal and Image Processing and Communications.
|
|
DSCC 481-01
Lisa Pink
M 6:15PM - 8:45PM
|
|
Topics include an introduction to Python programming, data structures, and statistical thinking relevant to healthcare data. The course will also provide a hands-on introduction to: widely used tools, languages and packages used for data science statistical analysis and visualization; creating statistical simulations with Python; libraries for machine learning and deep learning; and cloud services. THIS COURSE IS ONLINE
|
|
DSCC 482-01
Barney Ricca
T 6:15PM - 8:15PM
|
|
The course covers the essentials of the statistical foundations for interpretation and visualization of data. Using statistical tools, gain an understanding of how to interpret data quantitatively and to explore data-oriented structures. The primary focus is on descriptive statistics used to present and summarize numerical information. The course also emphasizes the design of systems for data visualization and related best practices for use in a healthcare setting. Course projects and assignments will involve accessing public healthcare data sources (e.g. MIMIC-III) or a deidentified data repository to extract relevant patient level attributes and create dashboards and visualizations to practice data storytelling for relevant healthcare problems. Students will be introduced to the R programming language which is a mainstay in statistical computing. This course is ONLINE
|
|
DSCC 483-01
Ajay Anand; Cantay Caliskan (Private)
MW 10:25AM - 11:40AM
|
|
The capstone/practicum provides an experience for data science majors/MS candidates to apply the core knowledge and skills attained during their program to a tangible data science focused project. Students will work in small teams on a project that applies data science methods to the analysis of a real-world problem. The instructor will guide each team in developing a topic that makes use of the knowledge the team members gained through their application area courses. The identified projects or problems and data sets will cover a range of application areas and reflect real-world needs from industry, medicine and government. Each student will be required to write a paper about their project, which satisfies one upper-level writing requirement for majors and Plan B for master's. FOR GRADUATING MS Candidate ONLY. PERMISSION REQUEST: To seek instructor permission/eligibility, follow directions on UR Student. https://tech.rochester.edu/wp-content/uploads/QRC-Requesting-Permission-to-Register_UofR-_0200227_cmf.pdf PREREQUISITES: DSCC 240/440 (Data Mining) AND an introductory statistics course such as DSCC 462 or equivalent; DSCC 261/461 (Database Systems) strongly recommended prior but may be taken concurrently.
|
|
DSCC 488-01
Barney Ricca
W 6:15PM - 8:45PM
|
|
This course will focus on deep learning technologies as applied to healthcare. The course will start with the basics of deep learning, then move to the use of using CNNs to process a variety of medical images for tasks such as classification, regression, and segmentation. Applications of deep learning for NLP will cover how to apply deep learning to clinical notes, radiology reports, etc., for text summarization, context review, and other inference tasks. The project exercises will also introduce students to text mining concepts applied to healthcare data. Emerging deep learning models including transformers, LLMs and an introduction to Generative AI will also be presented. PREREQUISITES: Python Programming and Tools for Data Science THIS COURSE IS ONLINE.
|
|
DSCC 489-01
Barney Ricca
R 6:15PM - 8:55PM
|
|
This is a culminating experiential learning course where students will work in teams or individually on real-world data analytic challenges; Each student will prepare a final presentation and report documenting their work which will also serve as the culminating exam for the program; Students will work with real-world clinical data and implementable solutions for delivery to actual or potential stakeholders. The course will also introduce students to the process of how to structure a clinical question as a data science/machine learning problem, data acquisition/provenance, insight generation, and enable them to practice effective communication and storytelling to diverse stakeholders encountered in healthcare settings. Students will have the opportunity to work with datasets and problems originating from various areas within healthcare including bioinformatics/genomics, medical imaging, physiological signals and sensor data. Students can work with the course instructor to identify suitable project opportunities in collaboration with URMC clinical departments, research groups such as UR Health Lab and/or from their professional network/current place of work. PREREQUISITES: DSCC 481 Python Programming and Tools for Data Science AND DSCC 482 Statistical Foundations and Data Visualization THIS COURSE IS ONLINE
|
|
DSCC 491-01
7:00PM - 7:00PM
|
|
This course is for master's students that have made arrangements with a faculty member to complete readings and discussion in a particular subject in their field of study.
|
|
DSCC 491-02
Heather Reyes
7:00PM - 7:00PM
|
|
This course is for master's students that have made arrangements with a faculty member to complete readings and discussion in a particular subject in their field of study. The Digital Health Innovation (DHI) Program is an elective experience designed for those looking to understand collaborative innovation in digital healthcare. It is currently supported by the Department of Pediatrics and the UR Health Lab, and includes learners with nursing, physician, and data science backgrounds. The DHI has two phases. The first involves two weeks of interactive sessions focused on three questions: 1) What is Digital Health? 2) How does Digital Health Exist in Academia? 3) How Can One Become an Entrepreneur? Sessions are presented in a variety of formats by expert faculty from across the medical center, university, and industry partners (i.e., anesthesiology, pediatrics, psychiatry, nursing, data & computer science, business, ethics). Covered topics are displayed in the below figure. This experience is followed by a longitudinal mentored project in one area of interest. The program allows for projects to take one of several forms: traditional research project, clinically focused quality improvement, or entrepreneurial focused proof of concept. Technical and clinical mentors are paired with learners during the first 2-4 months of the project based on their skills and interest. This year’s program will be September 21st- October 2nd. Interested graduate students should get a signed departmental research contract with Heather Reyes prior to registering for the course.
|
|
DSCC 495-01
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 495-02
Dillon Dzikowicz
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 495-03
Heather Reyes
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 495-04
Ram Haddas
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit. TITLE: URMC Motion Labs Description: * Recognize common human motion laboratory tools (i.e human motion capture, force plate, EMG, etc) and the types of data that are output from those devices. * Become knowledgeable of how human motion laboratory data is processed and analyzed for clinical and research purposes. * Understand the process of combining laboratory data with patient clinical outcomes (i.e. clinical standard of care, questionnaires, radiographic measurements, psychological factors, and medicate intake) into a large collective database * Experience recalling subsets of data from the database for use by laboratory personnel and UR Orthopaedic clinical staff. * Provide and prepare a subset of data statistical analysis Evaluation based on: * Development of a large-scale database to store all of the clinical and research data collected from patients/subjects in the human motion laboratory and clinical standard of care. * Development of an efficient process to recall a subset of the database to be used in clinical reporting and research purposes. * Development of an efficient process to store and recall control patient/subject data to be used in clinical reporting and publication purposes.
|
|
DSCC 495-05
Gonzalo Mateos Buckstein
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 495-06
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 495-07
Dongmei Li
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit. PROJECT TITLE: Association of smoking with commorbidity and COVID-19 severity PROJECT DESCRIPTION:Recent studies have shown COVID-19 patients with comorbidities have an increased risk for severe illness of COVID-19. Numerous studies have showed the link between smoking and comorbidities as smoking is a well-known risk factor for common comorbidities. However, the association of smoking and severity of COVID-19 is still unclear with inclusive results from recent studies. Given the association of comorbidities with both smoking and COVID-19, we propose to investigate the moderation effects of smoking in the association of comorbidities and COVID-19 using de-identified data (level 2) from the National COVID Cohort Collaborative (N3C). Our research question is whether smoking has moderation effects in the association of comorbidities with COVID-19 outcomes such as whether a patient is hospitalized, admitted to the ICU, and dead due to COVID-19. The proposed study will contribute to the literature on the direct and indirect association of both smoking/vaping and comorbidities with COVID-19 outcomes. Further we also plan to determine the health disparity relationship with smoking/vaping in COVID-19. EVALUATION: The student will read related papers and meet with the mentor weekly through Zoom. We expect to generate a manuscript from the study results. (1credit)
|
|
DSCC 495-08
Sukardi Suba
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 495-09
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 495-10
Tolulope Olugboji
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 495-18
Mujdat Cetin
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 495-19
Florian Jaeger
7:00PM - 7:00PM
|
|
This course provides master’s students with the opportunity to conduct, develop, and refine their research projects. Students will engage in research relevant to their field of study and make progress toward completing their degrees. Contact program coordinator and faculty before registering research for credit.
|
|
DSCC 511-02
Hangfeng He
MW 9:00AM - 10:15AM
|
|
This seminar offers an introduction to Large Language Models (LLMs), covering essential concepts such as Transformers, BERT, GPT-3, InstructGPT, prompting & decoding, and emergent abilities. Students will engage with a range of topics through paper presentations on themes such as Tool-Augmented LLMs, Multimodal Learning, LLMs for Science, Social and Ethical Concerns, Superintelligence Concerns, and Democratizing LLMs. Participants are required to present and discuss papers, write critical literature reviews, reproduce paper results, and collaborate on team projects. This seminar aims to provide a thorough understanding of LLMs, exploring their origins, opportunities, and concerns to enhance professional expertise in the field. Prerequisites: Students have completed at least one course in Natural Language Processing (NLP), Deep Learning (DL), or Machine Learning (ML), such as CSC 2/447 (Natural Language Processing), CSC 2/445 (Deep Learning), CSC 2/466 (Frontiers in Deep Learning), CSC 2/477 (End-to-End Deep Learning), and CSC 2/446 (Machine Learning). Other relevant courses in NLP, DL, or ML are also acceptable.
|
|
DSCC 895-01
7:00PM - 7:00PM
|
|
This course is designed for master's degree students who have completed all required coursework but still need to finalize specific degree requirements under less than half-time enrollment.
|
|
DSCC 897-01
Ajay Anand
7:00PM - 7:00PM
|
|
This course provides master's students who are currently completing their final required coursework, or with special circumstances like an approved reduced courseload, with the opportunity to work full-time on their degrees. Students will make significant progress toward completing their degrees.
|
|
DSCC 897A-01
7:00PM - 7:00PM
|
|
This course provides master's students who are currently completing their final required coursework, or with special circumstances like an approved reduced courseload, with the opportunity to work full-time on their degrees while studying in absentia. Students will make significant progress toward completing their degrees.
|
|
DSCC 899-01
7:00PM - 7:00PM
|
|
This course provides master’s students who have completed or are currently completing all course requirements with the opportunity to work full-time on their thesis. Students will make significant progress toward completing their degrees.
|
Fall 2026
| Number | Title | Instructor | Time |
|---|---|
| Monday | |
|
DSCC 481-01
Lisa Pink
|
|
|
Topics include an introduction to Python programming, data structures, and statistical thinking relevant to healthcare data. The course will also provide a hands-on introduction to: widely used tools, languages and packages used for data science statistical analysis and visualization; creating statistical simulations with Python; libraries for machine learning and deep learning; and cloud services. |
|
| Monday and Wednesday | |
|
DSCC 401-01
Brendan Mort
|
|
|
This course provides a hands-on introduction to widely-used tools for data science. Topics include Linux; languages and packages for statistical analysis and visualization; cluster and parallel computing including GPUs; Hadoop and Spark; libraries for machine learning; NoSQL databases; and cloud services. |
|
|
DSCC 511-02
Hangfeng He
|
|
|
This seminar offers an introduction to Large Language Models (LLMs), covering essential concepts such as Transformers, BERT, GPT-3, InstructGPT, prompting & decoding, and emergent abilities. Students will engage with a range of topics through paper presentations on themes such as Tool-Augmented LLMs, Multimodal Learning, LLMs for Science, Social and Ethical Concerns, Superintelligence Concerns, and Democratizing LLMs. Participants are required to present and discuss papers, write critical literature reviews, reproduce paper results, and collaborate on team projects. This seminar aims to provide a thorough understanding of LLMs, exploring their origins, opportunities, and concerns to enhance professional expertise in the field. |
|
|
DSCC 483-01
Ajay Anand; Cantay Caliskan (Private)
|
|
|
The capstone/practicum provides an experience for data science majors/MS candidates to apply the core knowledge and skills attained during their program to a tangible data science focused project. Students will work in small teams on a project that applies data science methods to the analysis of a real-world problem. The instructor will guide each team in developing a topic that makes use of the knowledge the team members gained through their application area courses. The identified projects or problems and data sets will cover a range of application areas and reflect real-world needs from industry, medicine and government. Each student will be required to write a paper about their project, which satisfies one upper-level writing requirement for majors and Plan B for master's. FOR GRADUATING MS Candidate ONLY. PERMISSION REQUEST: To seek instructor permission/eligibility, follow directions on UR Student. https://tech.rochester.edu/wp-content/uploads/QRC-Requesting-Permission-to-Register_UofR-_0200227_cmf.pdf |
|
|
DSCC 420-01
Gonzalo Mateos Buckstein
|
|
|
The goal of this course is to learn how to model, analyze and simulate stochastic systems, found at the core of a number of disciplines in engineering, for example communication systems, stock options pricing and machine learning. This course is divided into five thematic blocks: Introduction, Probability review, Markov chains, Continuous-time Markov chains, and Gaussian, Markov and stationary random processes. |
|
| Tuesday | |
|
DSCC 461-02
Eustrat Zhupa
|
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. |
|
|
DSCC 482-01
Barney Ricca
|
|
|
The course covers the essentials of the statistical foundations for interpretation and visualization of data. Using statistical tools, gain an understanding of how to interpret data quantitatively and to explore data-oriented structures. The primary focus is on descriptive statistics used to present and summarize numerical information. The course also emphasizes the design of systems for data visualization and related best practices for use in a healthcare setting. Course projects and assignments will involve accessing public healthcare data sources (e.g. MIMIC-III) or a deidentified data repository to extract relevant patient level attributes and create dashboards and visualizations to practice data storytelling for relevant healthcare problems. Students will be introduced to the R programming language which is a mainstay in statistical computing. |
|
| Tuesday and Thursday | |
|
DSCC 435-01
Jiaming Liang
|
|
|
This course primarily focuses on algorithms for large-scale optimization problems arising in machine learning and data science applications. The first part will cover first-order methods including gradient and subgradient methods, mirror descent, proximal gradient method, accelerated gradient method, Frank-Wolfe method, and inexact proximal point methods. The second part will introduce algorithms for nonconvex optimization, stochastic optimization, distributed optimization, manifold optimization, reinforcement learning, and those beyond first-order. |
|
|
DSCC 475-01
Ajay Anand
|
|
|
Time series analysis is a valuable data analysis technique in a variety of industrial (e.g., prognostics and health management), business (e.g., financial data analysis) and healthcare (e.g., disease progression modeling) applications. Moreover, forecasting in time series is an essential component of predictive analytics. The course will begin with an introduction to practical aspects relevant to time series data analysis such as data collection, characterization, and preprocessing. Topics covered will include smoothing methods (moving average, exponential smoothing), trend and seasonality in regression models, autocorrelation, AR and ARIMA models applied to time series data. Deep learning models including feedforward, recurrent, gated and convolutional architectures will also be studied. Students shall work on projects with time-series data sets using modeling tools in Python. |
|
|
DSCC 461-01
Eustrat Zhupa
|
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. |
|
|
DSCC 440-02
Monika Polak
|
|
|
Fundamental concepts and techniques of data mining, including data attributes, data visualization, data pre-processing, mining frequent patterns, association and correlation, classification methods, and cluster analysis. Advanced topics include outlier detection, stream mining, and social media data mining. CSC 440, a graduate-level course, requires additional readings and a course project. |
|
|
DSCC 465-01
Yukun Ma
|
|
|
The course provides an introduction to modern machine learning concepts, techniques, and algorithms. Topics discussed include regression, clustering and classification, kernels, support vector machines, feature selection, goodness of fit, neural networks. Programming assignments emphasize taking theory into practice, through applications on real-world data sets. Students will be expected to work with Python programming environment to complete the assignments. |
|
|
DSCC 462-01
Anson Kahng
|
|
|
This course will cover foundational concepts in descriptive analyses, probability, and statistical inference. Topics to be covered include data exploration through descriptive statistics (with a heavy emphasis on using R for such analyses), elementary probability, diagnostic testing, combinatorics, random variables, elementary distribution theory, statistical inference, and statistical modeling. The inference portion of the course will focus on building and applying hypothesis tests and confidence intervals for population means, proportions, variances, and correlations. Non-parametric alternatives will also be introduced. The modeling portion of the course will include ANOVA, and simple and multiple regression and their respective computational methods. Students will be introduced to the R statistical computing environment. |
|
| Wednesday | |
|
DSCC 461-03
Eustrat Zhupa
|
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. |
|
|
DSCC 461-04
Eustrat Zhupa
|
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. |
|
|
DSCC 488-01
Barney Ricca
|
|
|
This course will focus on deep learning technologies as applied to healthcare. The course will start with the basics of deep learning, then move to the use of using CNNs to process a variety of medical images for tasks such as classification, regression, and segmentation. Applications of deep learning for NLP will cover how to apply deep learning to clinical notes, radiology reports, etc., for text summarization, context review, and other inference tasks. The project exercises will also introduce students to text mining concepts applied to healthcare data. Emerging deep learning models including transformers, LLMs and an introduction to Generative AI will also be presented. |
|
| Thursday | |
|
DSCC 461-05
Eustrat Zhupa
|
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. |
|
|
DSCC 461-06
Eustrat Zhupa
|
|
|
This course presents the fundamental concepts of database design and use. It provides a study of data models, data description languages, and query facilities including relational algebra and SQL, data normalization, transactions and their properties, physical data organization and indexing, security issues and object databases. It also looks at the new trends in databases. The knowledge of the above topics will be applied in the design and implementation of a database application using a target database management system as part of a semester-long group project. |
|
|
DSCC 489-01
Barney Ricca
|
|
|
This is a culminating experiential learning course where students will work in teams or individually on real-world data analytic challenges; Each student will prepare a final presentation and report documenting their work which will also serve as the culminating exam for the program; Students will work with real-world clinical data and implementable solutions for delivery to actual or potential stakeholders. The course will also introduce students to the process of how to structure a clinical question as a data science/machine learning problem, data acquisition/provenance, insight generation, and enable them to practice effective communication and storytelling to diverse stakeholders encountered in healthcare settings. Students will have the opportunity to work with datasets and problems originating from various areas within healthcare including bioinformatics/genomics, medical imaging, physiological signals and sensor data. Students can work with the course instructor to identify suitable project opportunities in collaboration with URMC clinical departments, research groups such as UR Health Lab and/or from their professional network/current place of work. |
|
| Friday | |