Test of Data Query in Thai Text between Text Processing according to Semantic Text Similarity Measurement with Genetic Algorithm and Access to Data using Structured Query Language

Main Article Content

อิทธิศักดิ์ ศรีดำ

Abstract

           Data query system by keyword has been implemented in different fields such as query of frequently asked questions or answers by customer, query of information related to products or services, etc. Data query in various systems mostly use structured query language to access data. However, for data query in Thai text by structured query language, several issues are encountered including inappropriate information delivery, inaccurate information and deviated from user requirement. This article aims to present the test of data query in Thai text between text processing according to semantic text similarity measurement with genetic algorithm and access to data via structured query language as guidance of data query in Thai text using text processing according to semantic text similarity measurement with genetic algorithm that program developer is able to use as reference in data query development for higher efficiency.


          This study is conducted based on mixed methods research comprising of qualitative research and quantitative research by applying document research methods and experimental research by developing test program for data query in Thai text using text processing according to semantic text similarity measurement with genetic algorithm as well as test program for data access by structured query language. The two programs were tested in order to compare the performance using 2 testing criteria including queried text discovery rate and data query processing time. Data query test was conducted for 25 rounds and 99 experimental texts were used. Also, the same search term was used in the performance test of both programs.


          The study indicated that data query in Thai text using text processing according to semantic text similarity measurement with genetic algorithm has higher efficiency than access to data via structured query language with statistic t value equal to 159.716 (p<0.01) for average text discovery rate with statistical significance of 0.01 and statistic t value equal to -16.237 (p<0.01) for data query processing time with statistical significance of 0.01.

Article Details

Section
Original Articles

References

1. Sapsomboon B. Medical biology information search. Available from: URL: https://www.si. mahidol.ac.th/simi/pdf/pubmed May 16, 2019.
2. Laothongdee D. Search engine algorithm study and web promotion analysis. (Maters Thesis, Thammasat University). 2003.
3. Google. Advanced Search. Available from: URL: https:/google.com/advanced_search May 16, 2019.
4. Boonrawd P, Srisuktaksin P. Retrieving Big Data Using NoSQL and Semantic Web Technology. The Journal of KMUTNB 2015; 5(2): 255-264.
5. Korler P. SQL Issues. Available from: URL: http://stackoverflow.com/quest/SQL/Like-not-found May 16, 2019.
6. Svetthitikun Y, Supasetsiri P, Pakdeeronachit S. The analysis of factors for e-commerce affects the purchasing behavior of consu mers’ fashion apparel. The Journal of Social Communication Innovation 2017; 5(1): 21-31.
7. Vongsirisin K. The product value affecting the customer's purchase decision on V CARE U Natural & Premium Dog Food in Bangkok. Maters Thesis, Bangkok University). 2016. (in thai).
8. Pattrawadee M. Research Design for Mixed Method Research. Journal of the Association of Researchers 2016; 2(2): 19-31.
9. Wellington J. Education research: contem porary issues and practical approaches. London: Continuum; 2000.
10. Netisopakul P. Text similarity measurement for synonyms term and antonyms term of Thai. Available from: URL: www.it.kmitl.ac.th
/index.php/ main_journal/article May 10 2019. (in thai).
11. Scott, J. Social research and documentary sources. Sage Benchmarks in Social Research Methods, Documentary Research Volume 1. Thousand Oaks, California: SAGE Publication. 2006; 3-40.
12. Rigotti, B. How do I get my website on the first page of Google. Available from: URL: https://cristersmedia.com/how-do-i-get-my-/se/
/website-on-the-first-page-of-google/-/ May 16, 2019.
13. Mathworks. Find global minima for highly nonlinear problems. Available from: URL: https://mathworks.com/genetic-algorithm.html May 16, 2019.
14. Salah FAAS, Mohamad SA, El-Horbaty ESM. Swarm Intelligent Algorithms for solving load balancing in cloud computing. Egyptian Computer Science Journal 2019; 43(1): 45-57.
15. Whitley D. Next generation genetic algorithms: a user’s guide and tutorial. In: Gendreau M, Potvin JY. (eds) Handbook of metaheuristics. International Series in Operations Research & Management Science 2019; 272: Springer, Cham.
16. Netisopakul P. text similarity measurement for synonyms term and antonyms term of Thai. Available from: URL: http://www.it.kmitl.ac.th/ ~journal/index.php/ main_journal/article/
download/100/49 May 16, 2019. (in thai).
17. Rigotti B. How do I get my website on the first page of Google. Available from: URL: https://cristersmedia.com/how-do-i-get-myweb/ siteonthefirstpageofgoogle/ May 16 2019.
18. Google. Data search. Available from: URL: https://www.google.co,/search May 16 2019.
19. Eiben K, Agoston E, Smith JE. Introduction to Evolutionary Computing, Springer. Goldberg, David E. (1989), Genetic Algorithm in Search Optimization and Machine Learning, New York: Addison Wesley. 2003.
20. Sagum RA, Ramos AD, Llanes MT. FICOBU: Filipino WordNet construction using decision tree and language modeling. International Journal of Machine Learning and Computing 2019; 9(1): 103-107.
21. Mishra A, Sainul IA, Bhuyan, S, Deb S, Sen D, Deb AK. Development of a fexible assembly system using industrial robot with machine vision guidance and dexterous multi-finger gripper. Precision Product-Process Design and Optimization 2018; 31-71.
22. Yifei T, Meng Z, Jingwei L, Dongbo L, Yulin W. Research on intelligent welding robot path optimization based on GA and PSO algorithms. IEEE Access 2019; 6: 65397-65404.
23. Thoongsup SR. Thai WordNet construction. Proceedings of the 7th workshop on Asian language resources Association for computaional linguistics 2009: 139-144.
24. Yang D, Powers DM. Measuring semantic similarity in the taxonomy of WordNet. Australian Computer Society 2005; 38(1): 315-322.
25. Cai Y, Zhang Q, Che WA. hybrid approach for measuring semantic similarity based on IC-weighted path distance in WordNet. Journal of Intelligent Information Systems 2018; 51(1): 33-47.
26. Fazlia W. Zagerb AAAK, Rozaidaa G. Using improved firefly algorithm based on genetic algorithm crossover operator for solving optimization problems. Journal of Intelligent & Fuzzy Systems 2019; 36(2): 1547-1562.
27. Haoweia Z, Junweia X, Jiaanga G, Zhaojianb Z, Binfengc ZA. hybrid adaptively genetic algorithm for task scheduling problem in the phased array radar European Journal of Operational Research 2019; 272 (3): 868-878.
28. Krishna GJ, Ravi V. Feature subset selection using adaptive differential evolution: an application to banking. Proceedings of the ACM India Joint International Conference on Data Science and Management of Data 2019: 157-163.
29. Juneja SS, Saraswat P, Singh K. Sharma J, Majumdar R, Chowdhary S. Travelling salesman problem optimization using genetic algorithm. Amity International Conference on Artificial Intelligence (AICAI) 2019.