Econ115a: Econometrics Lab


Laboratory schedule: Wednesday | 13:00 - 16:00
Instructor: Christopher Llones
e-mail: christopher.llones@vsu.edu.ph
Pre-requisites: Stat21 or introductory statistics
Course credits: 3 units
Number of hours: 2 hrs lectures and 3 hrs laboratory per week


Course description

This laboratory course offers a practical, data-driven introduction to statistical methods used in economic research and policy analysis, with a particular focus on survey data. Students will engage with both foundational and advanced techniques for analyzing real-world datasets, emphasizing hands-on implementation, critical interpretation, and reproducible reporting.

Key topics include:

  • Fundamentals of statistical inference and data exploration
  • Survey design and measurement reliability
  • Assumption testing and model diagnostics
  • Correlation analysis and regression modeling (linear, logistic, and limited-dependent)
  • Hypothesis testing for means and group comparisons
  • Non-parametric statistical techniques
  • Multivariate methods including exploratory factor analysis

Students will use R—a powerful statistical programming language—to clean and visualize data, conduct analyses, and generate publication-ready outputs. By the end of the course, students will be equipped to design surveys, analyze empirical data, and communicate findings with clarity, rigor, and policy relevance.

Course objectives

By the end of the course, students will be able to:

  • Design and implement valid survey instruments
  • Conduct exploratory data analysis (EDA) on survey datasets
  • Evaluate statistical assumptions and model fit
  • Apply correlation and regression techniques, including models for limited-dependent variables
  • Perform parametric and non-parametric hypothesis tests
  • Implement multivariate techniques such as factor analysis and clustering
  • Use R for data wrangling, visualization, and reproducible reporting
  • Interpret and communicate statistical findings in academic and policy contexts
  • Critically assess empirical research for methodological soundness and relevance

Course outline

Topics Lessons Description
Module 1: Introduction to R Programming
  1. Installing R and RStudio
  2. R Basics
  3. Working with R scripts
  4. Importing data
  5. Basic data wrangling
  1. Learn to install and configure R and RStudio.
  2. Understand and apply basic R syntax including data types, vectors, and data frames.
  3. Develop proficiency in writing, saving, and executing R scripts.
  4. Import dataset and prepare them for analysis.
  5. Clean and transform data .
Module 2: Introduction to data visualization using ggplot2
  1. Understanding grammar of graphics
  2. Dataset and mapping
  3. Geometries
  4. Statistical transformation and plotting distribution
  5. Position adjustment and scales
  1. Understand the grammar of graphics and its role in structuring visualizations.
  2. Create and customize basic plots including histograms, bar charts, boxplots, and scatterplot.
  3. Map variables to visual aesthetics such as color, shape, and size to enhance interpretability.
  4. Apply faceting techniques to produce multi-panel plots for comparative analysis.
  5. Modify plot themes and coordinate systems to improve clarity and accessibility.
  6. Export visualizations for use in reports, presentations, policy briefs, and others.
Module 3: Reproducible report with Quarto in R
  1. Introduction to Quarto
  2. Creating Quarto document
  3. Embedding R code
  4. Formatting Outputs
  5. Exporting reports
  1. Learn to create dynamic, reproducible documents using Quarto and markdown syntax.
  2. Embed R code and inline calculations within narrative text to integrate analysis and interpretation.
  3. Format outputs such as tables and plots for professional presentation.
  4. Render reports to multiple formats including HTML, PDF, and Word for diverse audiences.
  5. Develop the ability to produce transparent, replicable research outputs for academic and policy contexts.
Module 1: survey research design
  1. Methods of data collection
  2. Sampling design in surveys
  3. Measurement issues in survey research
  4. Questionnaire construction
  5. Basics of interviewing
  6. Creating a codebook
  7. Data entry
  1. Discuss the various methods of data collection including survey, observation and experimental methods.
  2. Discuss different ways for gathering a sample; random and non-random sampling. Discuss rudimentary formulas for sample size calculation.
  3. Discuss the issues in assigning numbers to represent quantities of attributes. Discuss the various scales of measurement. Discuss criteria in constructing good measurement of variables: reliablity and validity.
  4. Discuss the various advantages and disadvantages of interviews and questionnaire over other methods of data collection.
  5. Discuss the do’s and dont’s of an interviewer’s conduct.
  6. Discuss the importance of creating a codebook for survey data.
  7. Discuss rudimentary of data entry.
Module 2: exploratory data analysis
  1. Rudiments of EDA
  2. Charts and tables
  3. Measures of central tendency
  4. Dispersion, parameters, skewness and kurtosis
  5. Contingency tables and scatter plot
  1. Discuss EDA as the first step in data analysis.
  2. Discuss various techniques in summarizing and visualizing data.
  3. Discuss the various measures of central tendency and data location.
  4. Discuss the relevance of various dispersion, parameters, skewness and kurtosis.
  5. Discuss the relevance of contingency tables and scatterplots for summarizing and visualizing data.
Module 3: test on means
  1. Parametric test on means
  2. Non-parametric test on means
  1. Discuss the various t-tests and ANOVA and perform them on a sample data with R.
  2. Perform various non-parametric equivalent of the t-tests and ANOVA on sample data with R.
Module 4: correlation and regression analysis
  1. Correlation analysis
  2. Review of Regression analysis
  1. Discuss the various types of correlation analysis procedures. Interpret the correlation coefficient.
  2. Discuss the various aspects of regression model building.
Module 5: limited-dependent variable models
  1. Review of binary dependent regression
  2. Extension to the logit model.
  3. Censored and truncated regression models.
  4. Count dependent variable models.
  1. Discuss the various aspects of the logit and probit.
  2. Discuss the multinomial and ordinal logit models.
  3. Discuss the Tobit regression model for censored data and truncated regression models.
  4. Discuss Poisson and Negative Binomial regression models in the regression analysis of count-dependent variable models.
Module 6: multivariate statistical analysis
  1. Cluster analysis
  2. Principal component analysis
  3. Exploratory factor analysis
  4. Confirmatory factor anlysis
  5. Structural equation modelling
  1. Understand and apply different clustering methods. Analyze and evaluate the quality and effectiveness of different clusters in dataset.
  2. Learn to perform and interpret principal component analysis to reduce the dimensionality of dataset. Develop the ability to identify and retain significant components for simplifying data without losing critical information.
  3. Identify and estimate underlying factor structures within a set of observed variables.
  4. Understand model-based factor anlaysis and develop proficiency in evaluating model fit and making necessary adjustments to improve analysis.
  5. Apply SEM techniques to understand relationships among variables and construct theoretical models.