14_310x - Module 01 - Introduction to the Course

May 13, 2021 ยท View on GitHub

Lecture 0101 - Welcome to the Course

010101

  • degree of a node -> number of connections a node has
  • eigenvector centrality -> importance of a node related to how important are its direct connections

010103

  • Benford's law: the distribution ofthe first digit in a numeric dataset is not uniform, but geometric: the digit 1 is far more common than the digit 9. Using this it is possible to test if datas has been fabricated at art or they are real observations.

010105

  • If you're very convinced about a particular causal story, you can look at the data in the way that is going to reinforce your belief in this very strong causal story.
  • So when one looks at data, one needs to use priors and theory to know what to look at.
  • But one also needs to let the data speak.
  • And in some sense, tie your hands to look at the data and not keep the part of the data that's good for you and not the part that's less good for you.

010106

  • Education at aggregate levels is more effective than education as individual level because of "externalities of education", where educated people can share and propagate the knowledge.

  • Omitted variables could explain why two series looks going together in a trend.

010108

  • Course program

    • probabilities: how to model the processes that might have generated our data
    • statistics: how do we summarize and describe the data and and how do we try to go from to date to the processes that might have generated it
    • exploratory data anlysis, econometry, machine learning: uncover the patterns between variables to find how variables might be related to each other.
    • various regression tests: to find causality in our data
  • How to do? In practice!:

    • using R
    • learning experiment design to obtain data
    • learn about data sources
  • How to represent our data and findings ?:

    • plots
    • tables
    • texts

010109

Detailed course program

  • Topic and number of lessons
  • Introduction and motivations 1
  • Probability 8
    • Definitiona
    • Random variables
    • Distributions of RVs
    • Function of RVs
    • Expectation, variance
  • Basic estimation and inference 3
    • Definitions
    • Estimators
    • CLT
    • Confidence intervals
    • Hypothesis testing
  • Randomised controlled trials 2
  • Nonparametric estimation 1
  • Causality 1
  • Regression analysis 4
  • Design of experiment 2
  • Machine learning 2
  • Assorted topics, such as visual display 1

See also the index from the last run of the course at https://screenshots.firefox.com/8msYlIPmQuNg74EI/courses.edx.org