WEB PAGE ELEMENT IDENTIFICATION USING SELENIUM AND CNN: A NOVEL APPROACH

Authors

  • Rohit Khankhoje Avon, Indiana, USA. Author

Keywords:

Web Testing, Selenium, Convolutional Neural Networks (CNNS), Element Identification, Web Automation, Web Scraping, Quality Assurance

Abstract

Web applications have become an integral aspect of our daily lives, thus rendering efficient testing and validation imperative. The process of software testing holds significant importance within the entirety of software development. In the present day, there exists a plethora of automated software testing tools that cater to the examination of diverse software applications, be it desktop-based, mobile applications, or web-based applications (Prasad et al., 2020). Within this discourse, we propose a novel approach that combines the prowess of Selenium, a renowned web automation tool, with convolutional neural networks (CNN), in order to automatically identify elements of web pages. Traditional web scraping methods often encounter difficulties when faced with dynamically generated content and intricate web structures. Our method, on the other hand, utilizes Selenium to navigate web pages, manipulate elements, as well as capture screenshots. These screenshots are subsequently processed using CNN to ascertain the identification and classification of web page elements.

To commence, we shall provide an elaborate explanation of the capabilities possessed by the Selenium framework, with a specific focus on its capacity to automate user interaction across various browsers when it comes to web applications. Selenium effectively captures real-time screenshots of web pages that are visible to users, thereby creating a valuable dataset that aids in the identification of elements. Subsequently, we shall delve deeper into the architecture of CNN and elucidate the means through which features can be extracted from these screenshots, enabling the classification of elements such as buttons, text boxes, and images. In order to validate our approach, experiments were conducted on diverse websites that possess intricate layouts and dynamic content. The outcome of these experiments serves as a testament to the efficacy of our method in accurately identifying and classifying web page elements, even in the most demanding of scenarios. Our research contributes significantly to the realms of web testing and automation by furnishing a robust solution for the identification of web page images. This approach serves to enhance the efficiency of the web testing process, minimize the need for manual intervention, and guarantee the dependability of web applications. As web technology continues to progress, our method serves as a promising avenue for automated website analysis and quality assurance.

References

Adha, A., Pamuncak, A., Qiao, W., & Laory, I. (2022). Automated building classification framework using convolutional neural network. Cogent engineering. 10.1080/23311916.2022.2065900

Fatima, S., Nasim, S. F., Haider, N. G., & Rasheed, M. (2023). Comparative Study of Software Automation Tools Selenium and Quick Test Professional. JISR on Computing. 10.31645/jisrc.23.21.1.6

Hossain, M. A., & Alam, M. S. (2019). Classification of Image using Convolutional Neural Network (CNN). Global journal of computer science and technology. 10.34257/GJCSTDVOL19IS2PG13

Lathwal, A. (2019). A Literature Review on Automation Testing Using Selenium+Sikuli. 10.4018/IJDAI.2019070104

Lucca, G., & Fasolino, A. R. (2005). Web Application Testing. 10.1007/3-540-28218-1_7

Mestry, S., Singh, H., Chauhan, R., Bisht, V., & Tiwari, K. (n.d.). Automation in Social Networking Comments with the Help of Robust fastText and CNN. 10.1109/ICIICT1.2019.8741503

Pelivani, E., & Cico, B. (2021). A comparative study of automation testing tools for web applications. 10.1109/MECO52532.2021.9460242

Prasad, L., Yadav, R., & Vore, N. (2020). A Systematic Literature Review of Automated Software Testing Tool. 10.1007/978-981-15-9712-1_10

Sharma, A., & Phonsa, G. (n.d.). Image Classification Using CNN. Social Science Research Network. DOI: 10.2139/SSRN.3833453

Downloads

Published

2023-10-02

How to Cite

WEB PAGE ELEMENT IDENTIFICATION USING SELENIUM AND CNN: A NOVEL APPROACH. (2023). JOURNAL OF SOFTWARE QUALITY ASSURANCE (JSQA), 1(1), 1-17. https://mylib.in/index.php/JSQA/article/view/JSQA_01_01_001