Advanced Courses in Life Sciences

Online Course -1st Edition

Introduction to Machine Learning applied to Palaeontology and Archaeology

September 7th-11th, 2020

Palaeontology and Archaeology

Registration

REGISTRATION IS CLOSED

Please, SUBSCRIBE to our Newsletter if you want to receive information on new editions.

Course overview

This course introduces students to the most advanced tools in Artificial Intelligence (AI); machine learning methods that make data mining and data processing a fascinating topic.

Obtaining and analyzing data is currently a very well developed field in computer science. Finding patterns in these data, or processing this information, is less straightforward and is sometimes subjected to biases. Data Mining has recently given way to Process Mining, in which powerful statistical and software tools are used in combination to correctly detect patterns and make reliable classifications of customers or products and make accurate predictions. For Paleobiology, these tools provide the most advanced computing technique for accurate classification and prediction.

This course offers a practical introduction to Machine Learning applied to Palaeontology and Archaeology. From class One, students will learn the use of these information-managing tools on their computers. After its completion, students will be prepared to understand the patterns hidden in any database, regardless of its size and complexity. For a practical demonstration, two types of taphonomic fields will be provided.

The study of bone surface modifications (BSM) has been one of the most difficult and controversial areas in taphonomic research. Only AI has provided a way to understands the subtleties of this type of analysis by yielding systematic identification rates of BSM with accuracy higher than 90% of the cases. This constitutes a major revolution in this field.

The second taphonomic field is biometric. As a practicum, metric properties of broken bones will be used to discern process (dry and green breaking) and agency (human or carnivore) in bone fragmentation.

Teaching will be done using R. In the last module involving computer vision and deep learning, both R and Python will be used.

Requirements

Basic knowledge of R is strongly recommended. If you are not familiar with R, you can learn it using the package Swirl.

Although students will benefit from having prior knowledge on statistics (namely, univariate and bivariate or multivariate statistics), the teaching system will not require them to have any statistical basis. Concepts will be explained from their basic foundation so that they are fully understood by students with different backgrounds.

All participants must have a personal computer (Windows, Macintosh), with webcam if possible, and a good internet connection.

Contact

courses@transmittingscience.com

LOCATION

This course will be delivered online.

Please check the schedule for the live online part, and be aware that it is GMT+1.

DATE

September 7th-11th, 2020

LANGUAGE

English

COURSE LENGTH & ECTS

30 hours online

This course is equivalent to 1 ECTS (European Credit Transfer System) at the Life Science Zurich Graduate School.

The recognition of ECTS by other institutions depends on each university or school.

PLACES

Places are limited to 15 participants and will be occupied by strict registration order. If the course fills up there will be an assistant instructor to help during the practise time.

Participants who have completed the course will receive a certificate at the end of it.

Instructor

Dr. Manuel Domínguez-Rodrigo
Complutense University
Spain

Coordinators

Ana Rosa Gomez-Cano coordinator at Transmitting Science

Dr. Ana Rosa Gómez-Cano
Transmitting Science
Spain

Soledad De Esteban-Trivigno Transmitting Science coordinator

Dr. Soledad De Esteban-Trivigno
Transmitting Science
Spain

Contact: courses@transmittingscience.com

Program

Daily Program

Monday

Microscopic characteristics of Bone Surface Modifications (BSM):
- Compiling all the microscopic characteristics that identify the different types of BSM; tooth marks (by all bone-modifying biotic agents), percussion marks, stone-tool and metal marks, trampling marks, biochemical marks. Practicum: microscopic observation of referential collections.
Comparison of traditional techniques to identify and quantify BSM:
- Showing the advantages and disadvantages of all the BSM tallying methods. Practicum: microscopic observation of referential collections II.
Introduction to Machine Learning. Practicum: an introduction to R:
- Introducing students to Big Data and the various ways data are generated and handled. Describing data volume, velocity, and veracity methods. Differentiating between Data obtainment, Data Mining and Data Processing. Introduction to R: vectors, matrices, data frames and data classes.
Simple prediction. Practicum: Simple regression:
- Seeking measurable patterns in variables. Differentiating among variable types, covariance and variable correlation. How to estimate the influence of variables on each other and predict values from one dependent variable from another explanatory variable. We will start using paleobiological examples.
Complex prediction. Practicum: Multiple regression:
- Expand the predictions of estimates of one dependent variable from a set of multiple variables. Analyze covariance and interactions between variables. Combine different types of explanatory variables. We will continue with paleobiological examples. Students will analyze profit predictions of one company based on investment on several types of advertising media.

Tuesday

Wednesday

Thursday

Rattle and Random Forests:
- In this section, students will use a GUI in R to apply some of the previous analyses in a more intuitive way, and they will also learn how to make Random Forests, which are a combination of boosting and bagging applied to regression and decision trees for the selection of variables that most accurately help in making the right classification or prediction.
Introduction to H2O and CARET:
- Here, a special mono-thematic session will be devoted to two of the most advanced R libraries for Machine learning: H2O and Caret. Comparative exercises will be carried out with previous algorithms to show the power of each of them on solving the same problems.

Friday

Introduction to Deep Learning and Computer Vision: Convolutional Neural Networks:
- Provide all the theoretical tools to understand the most powerful mathematical algorithms that exist for prediction and classification with a clear focus on image detection and classification. Neural networks will be explained and some of their most advanced algorithms, like convoluted neural networks, will be used. For this last module, the sessions involved will require learning some basics of Python and for that purpose the frameworks Anaconda and Jupiter books will be used. The use of Neural networks will be carried out using both R and Python. The depth of this module, by far the most complex of the course, will depend on the learning rate of students.
Practicum:
- This last module will focus on practical applications of all the software tools learnt and with several cases for data mining. Students will have to work on a personal supervised project with data sets most adequate to their professional interests. Both taphonomic data sets generated on BSM and Bone breakage can be used.

Required textbook: James, G., Witten, D., Hastie, T., Tibshirani, R., 2013. An Introduction to Statistical Learning with applications in R. Springer. A pdf version is available for free HERE.

Fees

Course Fee
Early bird (until July 31st, 2020):
596 € *
(476.8 € for Ambassador Institutions)
Regular (after July 31st, 2020):
725 € *
(580 € for Ambassador Institutions)
This includes course material (VAT included).
* Participants from companies/industry will have an extra charge of 100 €.
Registration

You can check the list of Ambassador Institutions. If you want your institution to become a Transmitting Science Ambassador please contact us at communication@transmittingscience.com

Schedule

Course Schedule

Monday to Friday (GMT+1):
- 14:00 to 18:00 online live lessons.

The rest of the time will be taught with recorded classes and assignments, to be done between the live sessions.

Funding

Discounts are not cumulative and apply only on the Course Fee. We offer the possibility of paying in two instalments (contact courses@transmittingscience.com).

5 % Discount

Former participants will have a 5 % discount on the Course Fee.

20 % Discount

20 % discount on the Course Fee is offered for members of some organizations (Ambassador Institutions). If you want to apply to this discount please indicate it in the Registration form (proof will be asked later).

40 % Discount

Unemployed scientists living in the country were the course will be held, as well as PhD students based in that country without any grant or scholarship to develop their PhD, could benefit from a 40 % discount on the Course Fee. If you want to ask for this discount, please contact the course coordinator. That would apply for a maximum of 2 places and they will be covered by strict inscription order.

Advanced Courses in Life Sciences