WEB PAGE ELEMENT IDENTIFICATION USING SELENIUM AND CNN: A NOVEL APPROACH
Keywords:
Web Testing, Selenium, Convolutional Neural Networks (CNNS), Element Identification, Web Automation, Web Scraping, Quality AssuranceAbstract
Web applications have become an integral aspect of our daily lives, thus rendering efficient testing and validation imperative. The process of software testing holds significant importance within the entirety of software development. In the present day, there exists a plethora of automated software testing tools that cater to the examination of diverse software applications, be it desktop-based, mobile applications, or web-based applications (Prasad et al., 2020). Within this discourse, we propose a novel approach that combines the prowess of Selenium, a renowned web automation tool, with convolutional neural networks (CNN), in order to automatically identify elements of web pages. Traditional web scraping methods often encounter difficulties when faced with dynamically generated content and intricate web structures. Our method, on the other hand, utilizes Selenium to navigate web pages, manipulate elements, as well as capture screenshots. These screenshots are subsequently processed using CNN to ascertain the identification and classification of web page elements.
To commence, we shall provide an elaborate explanation of the capabilities possessed by the Selenium framework, with a specific focus on its capacity to automate user interaction across various browsers when it comes to web applications. Selenium effectively captures real-time screenshots of web pages that are visible to users, thereby creating a valuable dataset that aids in the identification of elements. Subsequently, we shall delve deeper into the architecture of CNN and elucidate the means through which features can be extracted from these screenshots, enabling the classification of elements such as buttons, text boxes, and images. In order to validate our approach, experiments were conducted on diverse websites that possess intricate layouts and dynamic content. The outcome of these experiments serves as a testament to the efficacy of our method in accurately identifying and classifying web page elements, even in the most demanding of scenarios. Our research contributes significantly to the realms of web testing and automation by furnishing a robust solution for the identification of web page images. This approach serves to enhance the efficiency of the web testing process, minimize the need for manual intervention, and guarantee the dependability of web applications. As web technology continues to progress, our method serves as a promising avenue for automated website analysis and quality assurance.
References
Adha, A., Pamuncak, A., Qiao, W., & Laory, I. (2022). Automated building classification framework using convolutional neural network. Cogent engineering. 10.1080/23311916.2022.2065900
Fatima, S., Nasim, S. F., Haider, N. G., & Rasheed, M. (2023). Comparative Study of Software Automation Tools Selenium and Quick Test Professional. JISR on Computing. 10.31645/jisrc.23.21.1.6
Hossain, M. A., & Alam, M. S. (2019). Classification of Image using Convolutional Neural Network (CNN). Global journal of computer science and technology. 10.34257/GJCSTDVOL19IS2PG13
Lathwal, A. (2019). A Literature Review on Automation Testing Using Selenium+Sikuli. 10.4018/IJDAI.2019070104
Lucca, G., & Fasolino, A. R. (2005). Web Application Testing. 10.1007/3-540-28218-1_7
Mestry, S., Singh, H., Chauhan, R., Bisht, V., & Tiwari, K. (n.d.). Automation in Social Networking Comments with the Help of Robust fastText and CNN. 10.1109/ICIICT1.2019.8741503
Pelivani, E., & Cico, B. (2021). A comparative study of automation testing tools for web applications. 10.1109/MECO52532.2021.9460242
Prasad, L., Yadav, R., & Vore, N. (2020). A Systematic Literature Review of Automated Software Testing Tool. 10.1007/978-981-15-9712-1_10
Sharma, A., & Phonsa, G. (n.d.). Image Classification Using CNN. Social Science Research Network. DOI: 10.2139/SSRN.3833453
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Rohit Khankhoje (Author)
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.