Housing Price Prediction Using Deep Neural Networks: A Case Study on California Data
Abstract
Accurate estimation of housing prices is essential for informed decision-making in urban planning, real estate investment, and economic forecasting. In this study, a Deep Neural Network (DNN) model is developed to predict median housing values using the California Housing dataset. The dataset, comprising multiple socioeconomic and geographical features, is preprocessed through feature scaling and partitioned into training and testing subsets. The proposed DNN architecture consists of multiple hidden layers employing ReLU activations, optimized using the Adam algorithm with Mean Squared Error (MSE) as the loss function. The model was trained over 100 epochs, achieving a final test MSE of 0.2578 and a Mean Absolute Error (MAE) of 0.3380. Moreover, an R-squared score of 0.8032 indicates strong predictive power and generalization capability. These results suggest that deep learning models can effectively capture complex, nonlinear relationships in housing data and offer reliable tools for real-world applications.
Keywords:
Deep learning, Regression, California housing dataset, Neural network, Model evaluationReferences
- [1] Debreu, G. (1951). The coefficient of resource utilization. Econometrica: Journal of the econometric society, 273–292. https://doi.org/10.2307/1906814
- [2] Koopmans, T. C. (1951). An analysis of production as an efficient combination of activities. Activity analysis of production and allocation. https://cir.nii.ac.jp/crid/1572824499992043008
- [3] Farrell, M. J. (1957). The measurement of productive efficiency. Journal of the royal statistical society series a: Statistics in society, 120(3), 253–281. https://doi.org/10.2307/2343100
- [4] Charnes, A., Cooper, W. W., & Rhodes, E. (1978). Measuring the efficiency of decision making units. European journal of operational research, 2(6), 429–444. https://doi.org/10.1016/0377-2217(78)90138-8
- [5] Banker, R. D., Charnes, A., & Cooper, W. W. (1984). Some models for estimating technical and scale inefficiencies in data envelopment analysis. Management science, 30(9), 1078–1092. https://www.sciencedirect.com/science/article/pii/0377221778901388
- [6] Färe, R., Grosskopf, S., & Lovell, C. A. K. (1985). The measurement of efficiency of production (Vol. 6). Springer Science & Business Media. https://books.google.nl/books
- [7] Pastor, J. T., Ruiz, J. L., & Sirvent, I. (1999). An enhanced DEA Russell graph efficiency measure. European journal of operational research, 115(3), 596–607. https://doi.org/10.1016/S0377-2217(98)00098-8
- [8] Rizvi, M. (2023). Leveraging deep learning algorithms for predicting power outages and detecting faults: A review. Advances in research, 24, 80–88. https://www.researchgate.net
- [9] Ding, C., & Knaap, G. J. (2002). Property values in inner-city neighborhoods: The effects of homeownership, housing investment, and economic development. Housing policy debate, 13(4), 701–727. https://doi.org/10.1080/10511482.2002.9521462
- [10] Takase, T., Oyama, S., & Kurihara, M. (2018). Effective neural network training with adaptive learning rate based on training loss. Neural networks, 101, 68–78. https://doi.org/10.1016/j.neunet.2018.01.016