Jingqi Chen’s Data Science Portfolio

About Me

Projects

STA 141B Project

This project focus on the house price in California and the factor that affact house price. We conducted this project researching the relationship between public school rating and racial composition and their influences in home prices. Moreover, we examine the crime occurrence and house price. And in the end, use web scraping to predict the trend of the house price.

Group members: Ka Woon Tam, Yili Wang, Weizhuo Xiong

STA 141A Project

This project focus on World GDP Growth and Population Growth. The purposes of the project are to observe the changes of GDP and population from countries over the world and to predict the these variables from the United States.

UC Davis Statistics Research Training Group

This project focus on the high dimensional data. In this section, we will discuss some aspects of the Monte Carlo method our team used to simulate high dimensional data. The Monte Carlo methods are basically a class of computational algorithms that rely on repeated random sampling to obtain certain numerical results, and can be used to solve problems that have a probabilistic interpretation.Although using real high-dimensional data is also an option, we more often than not do not know the true distribution of these data points, so what we observe from real data might not always align nicely with theory. However, with simulated data, we can always test to see if what we’re expecting is correct if we fix a sample size, dimension, and distribution beforehand.