Anar al contingut (clic a Intro)
UdG Home UdG Home
Tancar
Menú

Estudia

General information

Academic year:
2024
Description:
The purpose of this course is to revise and understand the statistical basis of data science and to introduce some specific techniques that support data analysis and modelling in this context. 1. Basic data exploration and visualisation. Computational tools. 2. Concepts of probability and statistics. 3. Concepts and techniques of multivariate data analysis. 4. Statistical modelling: linear, non-linear and generalised linear models. 5. Model assessment: goodness of fit, predictive power, cross-validation.
Academic credits:
6
Course coordinator:
Javier Palarea Albaladejo

Groups

Group A

Duration:
One-semester, 1st semester
Teaching staff:
Language of the classes:
English (100%)

Competences

  • CG3- Communicate in an effective way both orally and in writing, preparing documents and presenting projects and results with English language
  • CG4- Designing creative proposals
  • CB10- That students have the learning skills to allow them to continue studying in a way that will mostly be self-directed or autonomous
  • CE1- Programming, at an advanced level, in the languages and libraries most used in intelligent field robotics
  • CE2- Analyse a problem related to intelligent autonomous systems and identify the appropriate techniques and tools to solve it
  • CE3- Understand, develop, modify and effectively apply machine learning methods.
  • CE8- Understand the mathematical foundations of intelligent robotic system algorithms

Syllabus

1. Overview of basic statistics and probability

          1.1. Getting started with R

          1.2. Basic concepts and data exploration

          1.3. Probability distributions

          1.4. Sampling, estimation and hypothesis testing

2. Introduction to multivariate data analysis

          2.1. Multivariate data

          2.2. Data reduction: principal components analysis and biplot

          2.3. Supervised classification: discriminant analysis

          2.4. Resampling and cross-validation

          2.5. Correspondence analysis of count data

          2.6. Low-dimensional visualisation: multidimensional scaling

3. Statistical modelling

          3.1. Linear and generalised linear regression

          3.2. Logistic regression for binary response

          3.3. Poisson regression for counts

          3.4. Additive models based on smooth splines

          3.5. Model assessment and simplification

          3.6. Regression analysis with many variables

Activities

Activity type Hours with a teacher Hours without a teacher Virtual hours with a teacher Total
Solution of exercises 12,00 42,00 0 54,00
Theory class 24,00 30,00 0 54,00
Hands-on class 18,00 24,00 0 42,00
Total 54,00 96,00 0 150

Bibliography

  • Faraway, J.J. (2016). Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models (Segona). Chapman and Hall/CRC.
  • Hastie, T., Tibshirani, R. and Friedman, J.H. (2016). The Elements of Statistical Learning: Data Mining, Inference, and Prediction (Segona). Springer.
  • James, G., Witten, D., Hastie, T. and Tibshirani, R. (2021). An Introduction to Statistical Learning with Applications in R (Segona). Springer.
  • Johnson, R.A., Wichern, D.W. (2007). Applied Multivariate Statistical Analysis (Sisena). Pearson Prentice Hall.
  • Schumacker, Randall E. (2016). Using R with multivariate statistics. Thousand Oaks, California: SAGE Publications, Inc.. Catàleg

Assessment and Grading

Assessment activities:

Description of the activity Assessment Activity % Remediable subject
Tasks and tests of application of statistical thinking, concepts and methods Understanding of concepts and use of statistical methods and models to solve real-world problems. Correct use of methods, computations and scientific software tools. 100 No

Grading

Assessment tests conducted during the course will be assessed out of 10 total marks each.

Specific criteria for the "No show" grade:
Not making any of the assessment tests.

Single Assessment:
To be discussed with the teacher at the start of the course. It would involve to pass a test in relation to the contents of the course.

Minimum requirements to pass:
A pass requires obtaining at least 5 marks, averaged over all tests conducted.

Mentorship

On request.

Communication and interaction with students

All communications and notices will be posted on the Moodle's noticeboard for the course.

The teacher can be reached in person during the sessions or by e-mail.

Escull quins tipus de galetes acceptes que el web de la Universitat de Girona pugui guardar en el teu navegador.

Les imprescindibles per facilitar la vostra connexió. No hi ha opció d'inhabilitar-les, atès que són les necessàries pel funcionament del lloc web.

Permeten recordar les vostres opcions (per exemple llengua o regió des de la qual accediu), per tal de proporcionar-vos serveis avançats.

Proporcionen informació estadística i permeten millorar els serveis. Utilitzem cookies de Google Analytics que podeu desactivar instal·lant-vos aquest plugin.

Per a oferir continguts publicitaris relacionats amb els interessos de l'usuari, bé directament, bé per mitjà de tercers (“adservers”). Cal activar-les si vols veure els vídeos de Youtube incrustats en el web de la Universitat de Girona.