• Welcome to the Open Risk Academy where we forge the open future of risk management!

    Acedemy Visual


    Search for a topic of interest or simply scroll down the page to find an interesting course!

    Can't find an interesting course? Use the course request form to outline a topic of interest and (subject to resources and fit) we will try to accomodate!


Courses


Academy News

New Course: Visualization of Time Series Data

by Ad Min -

A new course in the data science category offers a deep-dive into the structure of visualizations, in particular visualizations of timeseries data.

The course is now live here at the Academy

 

 
Pre-requisites and target audience

Knowledge of basic visualization techniques and mathematical notation of functions and maps. Familiarity with data series and their usage in data science. Should be useful for people who need to work with visualizations e.g., in the context of exploratory data analysis and who want to deepen their intuition about how visualizations are put together.

All visualizations are produced using open source Python or Javascript libraries but this is not a course about programming visualizations!

 

Summary of the Course

What we aim to achieve in this course is to deconstruct how both typical and less common visualizations of timeseries work.

In the first instance we decompose the visualization process into:

  • A mathematical transformation, which (optionally) may operate on the raw data and produce new representations thereof
  • A visual transformation, which converts quantitative data into a visual space

We apply this "recipe" to a large number of visualizations (21 in total), using always the same simple data series. The result is an exploration of the many diverse ways visualization can help extract meaning from data. The steps of course:

21 ways to visualize a simple timeseries

 

  1. A Numerical Table is also a Visualization
  2. Visualizing Observation Times
  3. Visualizing Observation Values
  4. Color Plots of Measurement Values
  5. Bubble Plots of Measurement Values
  6. Scatter Plots and their Limitations
  7. Linear Line Plot and Continuity
  8. The Step Plot and Discreteness
  9. The Smooth Plot: Pleasant but with a stinging tail
  10. The Area Chart: Filling up space to our advantage
  11. New Visualization Horizons with the Horizon Chart
  12. Abusing the Bar Chart Concept
  13. A Sorted Bar Chart and the power of mathematical transformations
  14. 14. The Histogram Transformation
  15. The (Probability) Density Plot and Mathematical Models
  16. The Lag Plot and Persistence
  17. Autocorrelation and further Arcana
  18. The Phase Diagram and Dynamical Systems
  19. Displaying data in the Frequency domain
  20. A Calendar is also a Visualization
  21. The Plot Thickens: The Weekly Calendar Version
Enjoy!

New courses in Data Science, Sustainable Finance and Credit Portfolio Management categories

by Ad Min -

The following three courses have been activated in the Open Risk Academy and are open to all registered users:

1. Input-Output Models as Graph Networks

Course Link

 

Economic Input-Output models find various applications in Sustainable Finance. They are typically expressed in term of matrices and vectors but a certain type of qualitative analysis shows strong affinity with graph theory. In this course we go over the relevant concepts and linkage between these two domains.

Summary of the Course
  • Step 1. In this step we discuss in more detail the motivation for the course and provide a very brief introduction to the graph theory to establish the notation.
  • Step 2. In this step we explore the duality between graphs and matrix representations.
  • Step 3. This step introduces the concept of Qualitative Input-Output Analysis
  • Step 4. In the fourth step off the course we discuss special kinds of nodes: Sources, Sinks and Conservation Laws
  • Step 5.In the final step of the course we discuss and interpret in graph terms the typical question one wants to answer with an IO model: what happens if there is new set of final demands?

2. Mathematical Representations of Credit Portfolio Data

Course Link

 

For our purposes in this course Credit Data is any well-defined dataset that has direct applications in the assessment of the Credit Risk of an individual or an organization. More generally, it is any dataset that allows the application of data-driven Credit Portfolio Management policies. Digging into the meaning of these data collections, the logic that binds them together, is essential for understanding what they can be used for and what limitations and issues they may be affected by. This course explores new angles to look at old practices.

Summary of the Course
  • Step 1. Definition of Credit Data
  • Step 2. Credit Data Classifications
  • Step 3. From Graphs to Reference Data
  • Step 4. Static Credit Data Snapshots
  • Step 5. Dynamic (Performance) Credit Data
  • Step 6. Scheduled versus Actual Cash Flows

3. An overview of graph methods in data science

Course Link

Graphs (and the related concept of Networks) have emerged from a relative mathematical and physics niche to an ubiquitous model for describing and interpreting various phenomena in very diverse domains. In fact the term graph appears now is so many different context it is hard to keep track of the meaning and relations between all these applications. In this course we aim to explore relations between different graph concepts as they are currently used in data science and related fields.

Summary of the Course
  • Step 1. Introduction
  • Step 2. The Graph of a Function
  • Step 3. The Mathematical Graph
  • Step 4. The Abstract Data Type (ADT) Graph
  • Step 5. Computation Graphs
  • Step 6. Data Graphs
  • Step 7. Property Graphs
  • Step 8. Knowledge Graphs
  • Step 9. Graph Databases
  • Step 10. Probabilistic Graph Models
  • Step 11. Graph Neural Networks (GNN)

Enjoy!

New Course: Working with Large Matrices using Command Line Tools

by Ad Min -

 

Dear Academy users,

we are happy to release a fresh new courseWorking with Large Matrices using Command Line Tools

What is this course about

In this course we explore a number of Linux command line tools (CLI):

  • Bash scripting
  • Several basic CLI commands (ls, cd, etc.)
  • File manipulation oriented CLI commands such as head, cut, wc
  • The awk programming language and scripting

We apply these in a very concrete context: working with large matrix files that form part of various economic input-output models. Such files are cumbersome to work with in spreadsheets, but on the other hand the overhead of using a full-blown statistical / data science environment might be also high. Command line tools offer a handy intermediate approach that may be useful in various context.

Prerequisites

Basic knowledge of and a working setup of a Linux or Linux-like development environment (including working with a shell and a text editor) is essential. Any standard Linux distribution should work (Using WSL on Windows machines) and MacOS as well (possibly with the installation of GNU tools). 

Some exposure to scripting and any general purpose programming language (E.g., Python, Javascript, C++, Java) is required for understanding the scripts and work through the awk exercises.

The course derives motivation from the large matrix data processing task. Hence, some idea of what a matrix is and why it is relevant to know how to work with them is assumed, but it is not required for completing the course as we do now go into any mathematical aspects of matrices.

Table of Contents

  • Motivation for Command Line tools
  • Overview and Setup of CLI Tools
  • A hello world in Awk
  • Downloading Data: Using command line tools to get published matrix data stored in local disk
  • Extracting Data: verify we have downloaded correct datasets and (if necessary) bring to a shape that makes it usable (e.g. uncompressing it)
  • Scanning Data Files: get a first high level view of what sort of files we have downloaded
  • Figuring out Structure and Dimensions: understand structure of the file (separators, total number of rows and columns involved and their nature).
  • Scrubbing / Cutting / Reshaping: create clean files where matrix data with a known number of rows and columns are stored in tab separated ascii format.
  • Transformations: Perform simple mathematical transformations and statistical operations. Investigate the degree to which matrix values are non-trivial (non-zero) 

Resources

We will work with Input-Output matrices downloaded from well known public distributions (EXIOBASE, FIGARO, OECD-ICIO). Scripts providing guidance and solutions to the suggested exercises are available the Open Risk Academy Gitub Repositories.

Enjoy!

 

New Course: An Introduction to the Copernicus Satellite Data Ecosystem

by Ad Min -
Earth Day is an annual event on April 22 to demonstrate support for environmental protection. First held on April 22, 1970, Earth Day celebrations now include a wide range of events coordinated globally by earthday.org (formerly Earth Day Network) including 1 billion people in more than 193 countries.

We honor this occasion releasing a new Academy data science course that provides an introduction to the Copernicus Satellite Data Ecosystem.  Copernicus is the name of an Earth observation system of the European Union's Space programme. Its objective is to look at our planet and its environment, in particular serving the needs of European citizens. Copernicus offers integrated information services (data) that draw both from Satellite based Earth Observations and In-Situ (non-space) data collection.

Picture of a sentinel satellite

This course should be a useful first reading for anybody who wants to get involved in using Copernicus data but has no prior experience.

In the first step we go over a high-level overview of the Copernicus program. This should serve as a first orientation for anybody interested to work with Copernicus data but is not yet sure what is available.

In the second step we build a high level catalog of the available data resources. Copernicus is a large program, involving many institutions, diverse satellites and instruments, providing original and processed data, including additional sources of ancillary data and an ever evolving set of tools providing data access. This section aims to organize all these to some degree.

In the third step and final step of this introduction into the Copernicus programme we discuss some tools that relevant for working with Copernicus data resources. Detailed work with Copernicus data requires using diverse specialized tools but there are ongoing efforts to systematize and standardize them with various toolboxes, platforms and API's. Here we only scratch the surface of the relevant technologies to get you started!

Enjoy, and as always keep us posted with any feedback, ideas or suggestions about how to improve the Open Risk Academy.

Open Source and Open Education (OEWeek online chat event)

by Ad Min -


Open Education Week Banner

Open source software is very effective in democratizing digital capabilities across a vast range of domains. As prime example, Moodle is an open source eLearning platform that powers hundreds of thousands of learning environments globally (it provides also the basic infrastructure for the Open Risk Academy). Yet contributing to open source faces the formidable barrier of scarce technical (digital) literacy. How can we best use open source to teach about open source? Can we set in motion a virtuous cycle that will produce a reinforcing mechanism where people use open source tools to build more open source education tools? This is the topic of a free format online chat event. 

The event should be interesting for anybody thinking or working at the intersection of open source and open education irrespective of which domain they specialize in. We'll try to make a number of different channels available:

The scheduled time is 5:00pm-6:00pm (Europe/Amsterdam time) on the 7th of March 2024. Come and join the discussion!

A new short course providing interactive analysis of input-output models

by Ad Min -

A new short introductory course on Input-Output analysis focuses on stylized interactive calculations that can be performed directly within the online reading sections.

The level of the course is introductory. There are no mathematical nor computer science prerequisites. It should be useful for anybody wishing to obtain insights into the flow of the basic calculations behind modern large scale Environmentally Extended Input-Output Models.

The objective of the course is to provide intuition and familiarity with the basic calculation workflow, indicatively:

From the Initial IO System

Calculate the Technical Requirements Matrix:

Calculate the Leontief Inverse Matrix:
Enter a new Demand vector:
To obtain the new IO system:


Enjoy!

Older topics...