Factor analysis and prediction of startups and ways to exit based on decision tree classification models with adaptive k with SMOTE method for imbalance problem

Main Article Content

Wararat Songpan
Ploypailin Kijkasiwat

Abstract

This paper focuses on factor analysis to combine the information of startups with an synthetic minority over-sampling technique (SMOTE) method via an aspect of the decision tree algorithms that assist investors in project screening for describing important features.However, the investment of a startup company has characteristics of imbalanced data. Improvements in the handling of imbalanced data based on the SMOTE method has been developed by sampling from the minority class. The problem is how to set optimized k-nearest neighbors among the most common feature values. This work purposed a method to fit data in the startup’s information that is designed to handle the data value by adaptive k with SMOTE, which manages the problem with an imbalance class label for robustness of evaluation metrics for balancing the portion of multi-class. The adaptive k experimental results can solve the k parameter setting and produce a high accuracy rate of startup companies’ class as closed, operating, and acquired status of investment at 0.84, 0.87 and 0.97 respectively. The overall accuracy rate is 0.99; that is the best outcome compared with other methods for handling imbalance. In addition, the results and discussion shown that can meet the needs of investment startup are designed and discussed of business views and machine learning views to work co-operation.

Downloads

Download data is not yet available.

Article Details

How to Cite
Songpan, W., & Kijkasiwat, P. (2023). Factor analysis and prediction of startups and ways to exit based on decision tree classification models with adaptive k with SMOTE method for imbalance problem. Science, Engineering and Health Studies, 17, 23040007. Retrieved from https://li01.tci-thaijo.org/index.php/sehs/article/view/258314
Section
Engineering

References

Ahlers, G. K. C., Cumming, D., Günther, C., and Schweizer, D. (2015). Signaling in equity crowdfunding. Entrepreneurship Theory and Practice, 39(4), 955–980.

Ahluwalia, S., and Kassicieh, S. (2022). Effect of financial clusters on startup mergers and acquisitions. International Journal of Financial Studies, 10(1), 1–13.

Andy, M. (2020). StartUp investments (Crunchbase). [Online URL: https://www.kaggle.com/datasets/arindam235/startup-investments-crunchbase] accessed on April 14, 2022.

Arroyo, J., Corea, F., Jimenez-Diaz, G., and Recio-Garcia, J. A. (2019). Assessment of machine learning performance for decision support in venture capital investments. IEEE Access, 7, 124233–124243.

Batista, G., Prati, R. C., and Monard. M. C. (2004). A study of the behavior of several methods for balancing machine learning training data. ACM Sigkdd Explorations Newsletter, 6(1), 20–29.

Bendickson, J. S., Muldoon, J., Liguori, E. W., and Midgett, C. (2017). High performance work systems: A necessity for startups. Journal of Small Business Strategy, 27(2), 1–12.

Bernstein, S., Korteweg, A., and Laws, K. (2017). Attracting early-stage investors: Evidence from a randomized field experiment. The Journal of Finance, 72(2), 509–538.

Bowen, D. E., Frésard, L., and Hoberg, G. (2019). Technological disruptiveness and the evolution of IPOs and sell-outs. In Swiss Finance Institute Research Paper Series No. 19–22. Zürich, Switzerland: Swiss Finance Institute.

Bozkaya, A., and van Pottelsberghe de la Potterie, B. (2008). Who funds technology-based small firms? Evidence from Belgium. Economics of Innovation and New Technology, 17(1–2), 97–122.

Chawla, N. V., Bowyer, K. W., Hall, L. O., and Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16(1), 321–357.

Coleman, S., Cotei, C., and Farhat, J. (2016). The debt-equity financing decisions of U.S. startup firms. Journal of Economics and Finance, 40(1), 105–126.

Cotei, C., and Farhat, J. (2018). The M&A exit outcomes of new, young firms. Small Business Economics, 50(3), 545–567.

Cumming, D., Siegel, D. S., and Wright, M. (2007). Private equity, leveraged buyouts and governance. Journal of Corporate Finance, 13(4), 439–460.

Davila, A., Foster, G., and Gupta, M. (2003). Venture capital financing and the growth of startup firms. Journal of Business Venturing, 18(6), 689–708.

Dibrova, A. (2015). Business angel investments: Risks and opportunities. Procedia - Social and Behavioral Sciences, 207, 280–289.

Dutta, S., and Folta, T. B. (2016). A comparison of the effect of angels and venture capitalists on innovation and value creation. Journal of Business Venturing, 31(1), 39–54.

Eghbali, N., and Montazer, G. A. (2017). Improving multiclass classification using neighborhood search in error correcting output codes. Pattern Recognition Letters, 100, 74–82.

Ewens, M., and Townsend, R. R. (2020). Are early stage investors biased against women? Journal of Financial Economics, 135(3), 653–677.

Farre-Mensa, J., Hegde, D., and Ljungqvist, A. (2020). What is a patent worth? Evidence from the U.S. patent “lottery.” The Journal of Finance, 75(2), 639–682.

Fisher, G., Kotha, S., and Lahiri, A. (2016). Changing with the Times: An integrated view of identity, legitimacy, and new venture life cycles. Academy of Management Review, 41(3), 383–409.

Flannery, M. J. (1994). Debt maturity and the deadweight cost of leverage: Optimally financing banking firms. The American Economic Review, 84(1), 320–331.

Gerasymenko, V., and Arthurs, J. D. (2014). New insights into venture capitalists’ activity: IPO and time-to-exit forecast as antecedents of their post-investment involvement. Journal of Business Venturing, 29(3), 405–420.

Gong, H., Sun, Y., Shu, X., and Huang, B. (2018). Use of random forests regression for predicting IRI of asphalt pavements. Construction and Building Materials, 189, 890–897.

Guo, B., Lou, Y., and Pérez-Castrillo, D. (2015). Investment, duration, and exit strategies for corporate and independent venture capital-backed start-ups. Journal of Economics & Management Strategy, 24(2), 415–455.

Haibo, H., Bai, Y., Garcia, E. A., and Li, S. (2008). ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In Proceeding of the IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), pp. 1322-1328. Hong Kong.

Hong, H., Liu, J., Bui, D. T., Pradhan, B., Acharya, T. D., Pham, B. T., Zhu, A.-X., Chen, W., and Ahmad, B. B. (2018). Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China). Catena, 163, 399–413.

Hyytinen, A., Pajarinen, M., and Rouvinen, P. (2015). Does innovativeness reduce startup survival rates? Journal of Business Venturing, 30(4), 564–581.

Kolev, J., and Schwartz, E. (2017). To price or not to price? Evaluating convertible-note startup financing. Academy of Management Proceedings, 2017(1), 10908.

Krishna, A., Agrawal, A., and Choudhary, A. (2016). Predicting the outcome of startups: Less failure, more. In Proceeding of the IEEE 16th International Conference on Data Mining Workshops (ICDMW), pp. 798–805. Barcelona, Spain.

Kwon, O., Lim, S., and Lee, D. H. (2018). Acquiring startups in the energy sector: A study of firm value and environmental policy. Business Strategy and the Environment, 27(8), 1376–1384.

Lee, S. M., and Lee, B. (2015). Entrepreneur characteristics and the success of venture exit: An analysis of single-founder start-ups in the U.S. International Entrepreneurship and Management Journal, 11(4), 891–905.

Lee, Y. W. (2019). Synergistic co-operations in the cosmetic industry: Learning and convergence between firms and social media. Kritika Kultura, 32, 237–259.

Levine, R., Lin, C., and Shen, B. (2020). Cross-border acquisitions: Do labor regulations affect acquirer returns? Journal of International Business Studies, 51(2), 194–217.

Li, J. (2020). Prediction of the success of startup companies based on support vector machine and random forest. In Proceeding of the 2nd International Workshop on Artificial Intelligence and Education, pp. 5–11. Montreal, QC, Canada.

Malliaris, A. G., and Malliaris, M. (2015). What drives gold returns? A decision tree analysis. Finance Research Letters, 13, 45–53.

Matricano, D. (2020). The effect of R&D investments, highly skilled employees, and patents on the performance of Italian innovative startups. Technology Analysis & Strategic Management, 32(10), 1195–1208.

Megginson, W. L., Meles, A., Sampagnaro, G., and Verdoliva, V. (2019). Financial distress risk in initial public offerings: How much do venture capitalists matter? Journal of Corporate Finance, 59, 10–30.

Miyamoto, H., Mejia, C., and Kajikawa, Y. (2022). A Study of private equity rounds of entrepreneurial finance in EU: Are buyout funds uninvited guests for startup ecosystems? Journal of Risk and Financial Management, 15(6), 236.

Mustafa, M. (2021). Valuation of an early stage business. In Springer Books, pp. 137–164. New York City: Springer.

Pisoni, A., and Onetti, A. (2018). When startups exit: Comparing strategies in Europe and the USA. Journal of Business Strategy, 39(3), 26–33.

Rahaman, M. M. (2014). Do managerial behaviors trigger firm exit? The case of hyperactive bidders. The Quarterly Review of Economics and Finance, 54(1), 92–110.

Rao, S. V. R., and Kumar, L. (2016). Role of angel investor in Indian startup ecosystem. FIIB Business Review, 5(1), 3–14.

Riepe, J., and Uhl, K. (2020). Startups’ demand for non-financial resources: Descriptive evidence from an international corporate venture capitalist. Finance Research Letters, 36, 101321.

Rompho, N. (2018). Operational performance measures for startups. Measuring Business Excellence, 22(1), 31-41.

Ross, G., Das, S., Sciro, D., and Raza, H. (2021). CapitalVX: A machine learning model for startup selection and exit prediction. The Journal of Finance and Data Science, 7, 94–114.

Rungi, M., Saks, E., and Tuisk, K. (2016). Financial and strategic impact of VCs on start-up development: Silicon Valley decacorns vs. Northern-European experience. In Proceeding of the 2016 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), pp. 452–456. Bali, Indonesia.

Salamzadeh, A., and Kawamorita Kesim, H. (2017). The enterprising communities and startup ecosystem in Iran. Journal of Enterprising Communities: People and Places in the Global Economy, 11(4), 456–479.

Sathaworawong, P., Saengchote, K., and Thawesaengskulthai, N. (2019). Success factor of start-up fund raising in ASEAN. Asian Administration & Management Review, 2(2), 1–26.

Schwienbacher, A. (2019). Equity crowdfunding: Anything to celebrate? Venture Capital, 21(1), 65–74.

Song, Y., Dana, L. P., and Berger, R. (2021). The entrepreneurial process and online social networks: Forecasting survival rate. Small Business Economics, 56(3), 1171–1190.

Tang, L., Tian, Y., and Pardalos, P. M. (2019). A novel perspective on multiclass classification: Regular simplex support vector machine. Information Sciences, 480, 324–338.

Thanapongporn, A., Ratananopdonsakul, R., and Chanpord, W. (2021). Key success factors and framework of fundraising for early-stage startups in Thailand. Academy of Strategic Management Journal, 20(2S), 1–16.

Thirupathi, A. N., Alhanai, T., and Ghassemi, M. M. (2021). A machine learning approach to detect early signs of startup success. In Proceedings of the Second ACM International Conference on AI in Finance, pp. 1–8. New York, USA.

Venugopal, B., and Yerramilli, V. (2022). Seed-stage success and growth of angel co-investment networks. The Review of Corporate Finance Studies, 11(1), 169–210.

Wang, Z., Wu, C., Zheng, K., Niu, X., and Wang, X. (2019). SMOTETomek-based resampling for personality recognition. IEEE Access, 7, 129678–129689.

Zhang, J. (2011). The advantage of experienced start-up founders in venture capital acquisition: evidence from serial entrepreneurs. Small Business Economics, 36(2), 187–208.