Web Mining for Enhanced Academic Visibility and Engagement Analysis Based on Visitor Data
Abstract
With online platforms that transform scholarly communication, academic journals must strategically amplify their digital footprint. This study demonstrates the value of using web analytics and time series modeling to uncover nuanced online readership trends and rhythms. Using the case of the Review of Rekayasa Sistem dan Teknologi Informasi/System Engineering and Information Technology (RESTI) Journal website's 2023 visitor data, we employ visual and ARIMA time series analysis to delineate engagement patterns aligned with academic cycles. The results reveal pronounced seasonal fluctuations, with the participation peaking in October and November, coinciding with increased research dissemination. Fitting an ARIMA model to daily new visitor data indicates positive autocorrelations, suggesting that the engagement effects persist on days. The model provides a predictive baseline for evaluating outreach initiatives. The study offers strategic information on aligning content planning with reading engagement rhythms. At the methodological level, the integration of data mining, predictive modeling, and information retrieval techniques establishes a versatile framework for investigating evolving scholarly communication dynamics in the digital age. The study also emphasizes meticulous data preparation and model diagnostics. The analytical approach presented provides actionable intelligence on trends in the use of academic portals online. This has far-reaching implications for journals seeking to strategically enhance their digital presence amidst increasing competition. With the proliferation of electronic resources, these techniques will only grow in importance for assessing and amplifying the impact of online scholarly platforms.
Downloads
References
M. Warschauer and T. Matuchniak, “New Technology and Digital Worlds: Analyzing Evidence of Equity in Access, Use, and Outcomes,” Review of Research in Education, vol. 34, no. 1, pp. 179–225, Mar. 2010, doi: 10.3102/0091732X09349791.
E. Darics and M. Cristina Gatti, “Talking a team into being in online workplace collaborations: The discourse of virtual work,” Discourse Stud, vol. 21, no. 3, pp. 237–257, Jun. 2019, doi: 10.1177/1461445619829240.
D. SHOTTON, “Semantic publishing: the coming revolution in scientific journal publishing,” Learned Publishing, vol. 22, no. 2, pp. 85–94, Apr. 2009, doi: 10.1087/2009202.
L. van Wel and L. Royakkers, “Ethical issues in web data mining,” Ethics Inf Technol, vol. 6, no. 2, pp. 129–140, 2004, doi: 10.1023/B:ETIN.0000047476.05912.3d.
R. M. Suresh and R. Padmajavalli, “An Overview of Data Preprocessing in Data and Web Usage Mining,” in 2006 1st International Conference on Digital Information Management, IEEE, Apr. 2007, pp. 193–198. doi: 10.1109/ICDIM.2007.369352.
G. C. TlAO, “An Introduction to Multiple Time Series Analysis,” Med Care, vol. 31, no. supplement, pp. YS71–YS74, May 1993, doi: 10.1097/00005650-199305001-00011.
I. G. Cohen, R. Amarasingham, A. Shah, B. Xie, and B. Lo, “The Legal And Ethical Concerns That Arise From Using Complex Predictive Analytics In Health Care,” Health Aff, vol. 33, no. 7, pp. 1139–1147, Jul. 2014, doi: 10.1377/hlthaff.2014.0048.
L. van Wel and L. Royakkers, “Ethical issues in web data mining,” Ethics Inf Technol, vol. 6, no. 2, pp. 129–140, 2004, doi: 10.1023/B:ETIN.0000047476.05912.3d.
X. Gao, R. Tan, and G. Li, “Research on Text Mining of Material Science Based on Natural Language Processing,” IOP Conf Ser Mater Sci Eng, vol. 768, p. 072094, Mar. 2020, doi: 10.1088/1757-899X/768/7/072094.
T. H. Syah, “The Role of Genetics in Domestic Research on Forestry Issues: A Text Mining Analysis,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 6, pp. 1006–1013, 2022, doi: 10.29207/resti.v6i6.4228.
S. Kurniawan, W. Gata, D. A. Puspitawati, I. K. S. Parthama, H. Setiawan, and S. Hartini, “Text Mining Pre-Processing Using Gata Framework and RapidMiner for Indonesian Sentiment Analysis,” IOP Conf Ser Mater Sci Eng, vol. 835, p. 012057, May 2020, doi: 10.1088/1757-899X/835/1/012057.
X. Wang, A. Abraham, and K. A. Smith, “Intelligent web traffic mining and analysis,” Journal of Network and Computer Applications, vol. 28, no. 2, pp. 147–165, Apr. 2005, doi: 10.1016/j.jnca.2004.01.006.
Ronal Watrianthos and Y. Yuhefizar, “Exploring Research Trends and Impact: A Bibliometric Analysis of RESTI Journal from 2018 to 2022,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 7, no. 4, pp. 970–981, Aug. 2023, doi: 10.29207/resti.v7i4.5101.
A. Hotho, A. Nürnberger, and G. Paaß, “A Brief Survey of Text Mining,” Journal for Language Technology and Computational Linguistics, vol. 20, no. 1, pp. 19–62, Jul. 2005, doi: 10.21248/jlcl.20.2005.68.
F. Rasyid and D. A. Adytia, “Time Series Temperature Forecasting by using ConvLSTM Approach, Case Study in Jakarta,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 4, pp. 563–569, 2022, doi: 10.29207/resti.v6i4.4162.
C. S. Alfredo and D. A. Adytia, “Time Series Forecasting of Significant Wave Height using GRU, CNN-GRU, and LSTM,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 5, pp. 776–781, 2022, doi: 10.29207/resti.v6i5.4160.
V. P. Ariyanti and T. Yusnitasari, “Comparison of ARIMA and SARIMA for Forecasting Crude Oil Prices,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 7, no. 2, pp. 405–413, 2023, doi: 10.29207/resti.v7i2.4895.
N. K. A. I. Suryani, O. Sudana, and A. Wirdiani, “Forecasting Pneumonia Toddler Mortality Using Comparative Model ARIMA and Multilayer Perceptron,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 4, pp. 528–537, 2022, doi: 10.29207/resti.v6i4.4106.
I. Mardianto, M. I. Gunawan, D. Sugiarto, and A. Rochman, “Comparison of Rice Price Forecasting Using the ARIMA Method on Amazon Forecast and Sagemaker,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 4, no. 3, pp. 537–543, 2020, doi: 10.29207/resti.v4i3.1902.
T. Stadnytska, S. Braun, and J. Werner, “Model Identification of Integrated ARMA Processes,” Multivariate Behav Res, vol. 43, no. 1, pp. 1–28, Mar. 2008, doi: 10.1080/00273170701836646.
H. Alshraideh and G. Runger, “Process Monitoring Using Hidden Markov Models,” Qual Reliab Eng Int, vol. 30, no. 8, pp. 1379–1387, Dec. 2014, doi: 10.1002/qre.1560.
Tarno, Subanar, D. Rosadi, and Suhartono, “New procedure for determining order of subset autoregressive integrated moving average (ARIMA) based on over-fitting concept,” in 2012 International Conference on Statistics in Science, Business and Engineering (ICSSBE), IEEE, Sep. 2012, pp. 1–5. doi: 10.1109/ICSSBE.2012.6396643.
A. S. Ahmar et al., “Modeling Data Containing Outliers using ARIMA Additive Outlier (ARIMA-AO),” J Phys Conf Ser, vol. 954, no. 1, 2018, doi: 10.1088/1742-6596/954/1/012010.
A. S. Ahmar, A. D. GS, T. Listyorini, and ..., “Implementation of the ARIMA (p, d, q) method to forecasting CPI Data using forecast package in R Software,” Journal of Physics …, 2018, doi: 10.1088/1742-6596/1028/1/012189.
Copyright (c) 2024 Journal of Systems Engineering and Information Technology (JOSEIT)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).