Data Analysis on the Influencing Factors of the Real Estate Price
DOI:
https://doi.org/10.37256/aie.222021966Keywords:
data analysis, real estate price, decision tree, random forestAbstract
China has carried out a large number of real estate market reforms that change the real estate market demand considerably. At the same time, the real estate price has soared in some cities and has surpassed the spending power of many ordinary people. As the real estate price has received widespread attention from society, it is important to understand what factors affect the real estate price. Therefore, we propose a data analysis method for finding out the influencing factors of real estate prices. The method performs data cleaning and conversion on the used data first. To discretize the real estate price, we use the mean ± standard deviation (SD), mean ± 0.5 SD, and mean ± 2 SD of the price and divide it into three categories as the output variable. Then, we establish the decision tree and random forest model for six different situations for comparison. When the data set is divided into training data (70%) and testing data (30%), it has the highest testing accuracy. In addition, by observing the importance of each input variable, it is found that the main influencing factors of real estate price are cost, interior decoration, location, and status. The results suggest that both the real estate industry and buyers should pay attention to these factors to adjust or purchase real estate.