Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Purchase individual online access for 1 year to this journal.
Price: EUR 315.00Impact Factor 2023: 2
The purpose of the Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology is to foster advancements of knowledge and help disseminate results concerning recent applications and case studies in the areas of fuzzy logic, intelligent systems, and web-based applications among working professionals and professionals in education and research, covering a broad cross-section of technical disciplines.
The journal will publish original articles on current and potential applications, case studies, and education in intelligent systems, fuzzy systems, and web-based systems for engineering and other technical fields in science and technology. The journal focuses on the disciplines of computer science, electrical engineering, manufacturing engineering, industrial engineering, chemical engineering, mechanical engineering, civil engineering, engineering management, bioengineering, and biomedical engineering. The scope of the journal also includes developing technologies in mathematics, operations research, technology management, the hard and soft sciences, and technical, social and environmental issues.
Authors: Yue, Lizhu | Lv, Yue
Article Type: Research Article
Abstract: The Vlsekriterijumska Optimizacija I Komprosmisno Resenie (VIKOR) method to some extent modifies the utility function to a value function that can consider different risk preferences. However, the weight and risk attitude parameters involved in the model are difficult to determine, which limits its application. To overcome this problem, a Poset-VIKOR model is proposed. A partial order set is a non-parametric decision-making method. Through the combination of partial order set and VIKOR model, the parameters can be “eliminated”, and a robust method that can run the model is obtained. This method uses the Hasse diagram to express the evaluation results, which …can not only directly display the hierarchical and clustering information, but also show the robustness characteristics of the alternative comparison. Show more
Keywords: VIKOR method, poset, weight, multiple attribute decision making
DOI: 10.3233/JIFS-230680
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-17, 2024
Authors: Shao, Dangguo | Huang, Chunsheng | Liu, Cuiyin | Ma, Lei | Yi, Sanli
Article Type: Research Article
Abstract: The automatic segmentation of diabetic retinopathy (DR) holds significant importance for assisting physicians in diagnosis and treatment. Given the complexity, high inter-class similarity, and uncertainty of DR, it is crucial to integrate multiscale information between lesions and establish global correlations among them. To address these issues, a novel HRU-TNet (Hybrid Residual U-Transformer Network) algorithm for retinal lesion segmentation is proposed. In this framework, the network is augmented with lightweight self-attention residual U-modules (LSA-RSU) to capture high-frequency details of the lesions and global contextual information. The skip connections are then enhanced through interactive residual transformer fusion modules (IRTF) and channel-cross attention …(CCA), promoting dependencies among features at different scales and filtering out interfering information to guide feature fusion and eliminate ambiguity. Additionally, a novel retinal image enhancement technique is devised, employing local wavelet transformations to capture detailed components of the retinal images, thereby enhancing the representational capacity of the segmentation network. Data augmentation is also performed to ensure network adaptability to small datasets. Comprehensive experiments conducted on the publicly available IDRID and e_ophtha datasets yielded average AUC_PR values of 0.709 and 0.451, respectively. The proposed approach demonstrated superior generalization on the DDR dataset compared to other methods mentioned in the literature. These results demonstrate that our proposed method is better suited for small retinal datasets, exhibiting improved segmentation accuracy and generalization compared to existing approaches. Show more
Keywords: Lesion segmentation, fundus image enhancement, transformer, cross attention fusion, light self-attention residual
DOI: 10.3233/JIFS-240788
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: He, Xiaorong | Fang, Anran | Yu, Dejian
Article Type: Research Article
Abstract: Electronic commerce (EC) has become the most critical business activity in the world. China has become the world’s largest market for EC. Over the past three decades, numerous researches have examined the current status of the development of monolingual EC research in specific scenarios. However, the paradigm shift in EC development through the analysis of the dynamic evolution of semantic information has not yet been examined, and the distinctions and connections between multilingual EC studies have not yet been established. This study analyzed 16,207 English and 17,850 Chinese EC-related articles from the Web of Science database and CNKI by combining …the BERTopic topic model and SBERT sentence embedding-based similarity computations. The results reveal the distributions of global and local topics in the English and Chinese EC literature, analyze the semantic intricacies of topic convergence and evolution across continuous time, as well as the distinctions and connections between English and Chinese topics. Finally, the evolutionary patterns and life cycle of three crucial English and Chinese topics are explored respectively, including their emergence, development, maturity, and decline. Overall, this study provides a comprehensive overview of EC studies from a topic perspective. Show more
Keywords: Electronic commerce, BERTopic, topic modeling, topic evolution, sentence embedding
DOI: 10.3233/JIFS-232825
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-22, 2024
Authors: Kazancı, O. | Hoskova-Mayerova, S. | Davvaz, B.
Article Type: Research Article
Abstract: In recent years, the m-polar fuzziness structure has attracted the attention of researchers and has been commonly applied in algebraic structures. In this article, we present the notion of multi-polar fuzzy hyperideals of ordered semihyperrings, which is a generalization of the concept of bi-polar fuzzy hyperideals of ordered semihyperrings. We investigate some of their associated properties. Furthermore, we characterized regular ordered semihyperring in terms of multi-polar fuzzy quasi-ideals and multi-polar fuzzy bi-ideals.
Keywords: Semihyperring, ordered semihyperring, m-polar fuzzy semihyperring, m-polar fuzzy hyperideals
DOI: 10.3233/JIFS-238654
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-09, 2024
Authors: Ameen, Zanyar A. | Mohammed, Ramadhan A. | Al-shami, Tareq M. | Asaad, Baravan A.
Article Type: Research Article
Abstract: This paper introduces a new fuzzy structure named “fuzzy primal.” Then, it studies the essential properties and discusses their basic operations. By applying the q-neighborhood system in a primal fuzzy topological space and the Łukasiewicz disjunction, we establish a fuzzy operator (·) ⋄ on the family of all fuzzy sets, followed by its core characterizations. Next, we use (·) ⋄ to investigate a further fuzzy operator denoted by Cl ⋄ . To determine a new fuzzy topology from the existing one, the earlier fuzzy operators are explored. Such a new fuzzy topology is called primal fuzzy topology. Various properties of …primal fuzzy topologies are found. Among others, the structure of a fuzzy base that generates a primal fuzzy topology. Furthermore, the concept of compatibility between fuzzy primals and fuzzy topologies is introduced, and some equivalent conditions to that concept are examined. It is shown that if a fuzzy primal is compatible with a fuzzy topology, then the fuzzy base that produces the primal fuzzy topology is itself a fuzzy topology. Show more
Keywords: Fuzzy primal, fuzzy grill, fuzzy ideal, primal fuzzy topology, fuzzy ideal topology
DOI: 10.3233/JIFS-238408
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-10, 2024
Article Type: Research Article
Abstract: Background: Breast cancer diagnosis relies on accurate lesion segmentation in medical images. Automated computer-aided diagnosis reduces clinician workload and improves efficiency, but existing image segmentation methods face challenges in model performance and generalization. Objective: This study aims to develop a generative framework using a denoising diffusion model for efficient and accurate breast cancer lesion segmentation in medical images. Methods: We design a novel generative framework, PalScDiff, that leverages a denoising diffusion probabilistic model to reconstruct the label distribution for medical images, thereby enabling the sampling of diverse, plausible segmentation outcomes. Specifically, with the …condition of the corresponding image, PalScDiff learns to estimate the masses region probability through denoising step by step. Furthermore, we design a Progressive Augmentation Learning strategy to incrementally handle segmentation challenges of irregular and blurred tumors. Moreover, multi-round sampling is employed to achieve robust breast mass segmentation. Results: Our experimental results show that PalScDiff outperforms established models such as U-Net and transformer-based alternatives, achieving an accuracy of 95.15%, precision of 79.74%, Dice coefficient of 77.61%, and Intersection over Union (IOU) of 81.51% . Conclusion: The proposed model demonstrates promising capabilities for accurate and efficient computer-aided segmentation of breast cancer. Show more
Keywords: Diffusion model, consistent regularization, breast cancer, medical image segmentation, data augmentation
DOI: 10.3233/JIFS-239703
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Yang, Guang | Qi, Juntong | Wang, Mingming | Wu, Chong | Liu, Yansheng | Liu, Zhengjun | Ping, Yuan
Article Type: Research Article
Abstract: Target encirclement is widely used in the field of unmanned aerial vehicles(UAVs), which can effectively monitor and intercept external threats. However, the integration from target detection, localization to final tracking is difficult or costly. This article proposes a complete and inexpensive framework of the target encirclement for multiple quadrotors. The framework consists of three modules: object detection, target localization and formation tracking. Firstly, a one-stage object detector based on a convolutional neural network is used to achieve fast and accurate object detection. Then, combined with the position and attitude states of the quadrotor, a 3D target localization scheme to locate …the target position is proposed. Based on consensus theory, a time-varying formation tracking control protocol is proposed. Finally, a multiple quadrotor platform composed of one reconnaissance quadrotor and four hunter quadrotors is built with self-organizing network communication, which avoids the expensive cost of deploying object detection modules on each quadrotor platform. We deployed the framework on the multiple quadrotor platform and conducted static and dynamic localization and encirclement experiments with a minibus as the target. The result shows that the reconnaissance quadrotors can detect and accurately locate targets over 30 fps , and the average deviation of locating the target minibus could reach a minimum of 0.0712 m . The hunter quadrotors could track and encircle the dynamic moving target minibus in a time-varying formation. Experiments demonstrate the effectiveness and practicality of the proposed framework of the target encirclement for multiple quadrotors. Show more
Keywords: Multiple quadrotors, target encirclement, visual detection, target localization, time-varying formation tracking
DOI: 10.3233/JIFS-238335
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Ou, Qiqi | Zhang, Xiaohong | Wang, Jingqian
Article Type: Research Article
Abstract: Fuzzy rough sets (FRSs) play a significant role in the field of data analysis, and one of the common methods for constructing FRSs is the use of the fuzzy logic operators. To further extend FRSs theory to more diverse information backgrounds, this article proposes a covering variable precision fuzzy rough set model based on overlap functions and fuzzy β-neighbourhood operators (OCVPFRS). Some necessary properties of OCVPFRS have also been studied in this work. Furthermore, multi-label classification is a prevalent task in the realm of machine learning. Each object (sample or instance) in multi-label data is associated with various labels (classes), …and there are numerous features or attributes that need to be taken into account within the attribute space. To enhance various performance metrics in the multi-label classification task, attribute reduction is an essential pre-processing step. Therefore, according to overlap functions and fuzzy rough sets’ excellent work on applications: such as image processing and multi-criteria decision-making, we establish an attribute reduction method suitable for multi-label data based on OCVPFRS. Through a series of experiments and comparative analysis with existing multi-label attribute reduction methods, the effectiveness and superiority of the proposed method have been verified. Show more
Keywords: Fuzzy rough sets, overlap functions, fuzzy β-neighbourhood operators, attribute reduction, multi-label classification
DOI: 10.3233/JIFS-238245
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-19, 2024
Authors: Embriz-Islas, Cesar | Benavides-Alvarez, Cesar | Avilés-Cruz, Carlos | Zúñiga-López, Arturo | Ferreyra-Ramírez, Andrés | Rodríguez-Martínez, Eduardo
Article Type: Research Article
Abstract: Speech recognition with visual context is a technique that uses digital image processing to detect lip movements within the frames of a video to predict the words uttered by a speaker. Although models with excellent results already exist, most of them are focused on very controlled environments with few speaker interactions. In this work, a new implementation of a model based on Convolutional Neural Networks (CNN) is proposed, taking into account image frames and three models of audio usage throughout spectrograms. The results obtained are very encouraging in the field of automatic speech recognition.
Keywords: CNN, artificial intelligence, deep learning, speech recognition
DOI: 10.3233/JIFS-219346
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Zavala-Díaz, Jonathan | Olivares-Rojas, Juan C. | Gutiérrez-Gnecchi, José A. | Téllez-Anguiano, Adriana C. | Alcaraz-Chávez, J. Eduardo | Reyes-Archundia, Enrique
Article Type: Research Article
Abstract: Efficient medical information management is essential in today’s healthcare, significantly to automate diagnoses of chronic diseases. This study focuses on the automated identification of diabetic patients through a clinical note classification system. This innovative approach combines rules, information extraction, and machine learning algorithms to promise greater accuracy and adaptability. Initially, the four algorithms evaluated showed similar performance, with Gradient Boosting standing out with an accuracy of 0.999. They were tested on our clinical and oncology notes, where SVM excelled in correctly labeling non-oncology notes with a 0.99. Gradient Boosting had the best average with 0.966. The combination of rules, information …extraction, and Random Forest provided the best average performance, significantly improving the classification of clinical notes and reducing the margin of error in identifying diabetic patients. The principal contribution of this research lies in the pioneering integration of rule-based methods, information extraction techniques, and machine learning algorithms for enhanced accuracy in diabetic patient identification. For future work, we consider implementing these algorithms in natural clinical settings to evaluate their practical performance. Additionally, additional approaches will be explored to improve the accuracy and applicability of clinical note-grading systems in healthcare. Show more
Keywords: NLP, diabetes, machine learning, binary classification, word frequency analysis
DOI: 10.3233/JIFS-219375
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Martinez, German | Duta, Eduard-Andrei | Sanchez-Romero, Jose-Luis | Jimeno-Morenilla, Antonio | Mora-Mora, Higinio
Article Type: Research Article
Abstract: Within various industrial settings, such as shipping, aeronautics, woodworking, and footwear, there exists a significant challenge: optimizing the extraction of sections from material sheets, a process known as “nesting”, to minimize wasted surface area. This paper investigates efficient solutions to complex nesting problems, emphasizing rapid computation over ultimate precision. We introduce a dual-approach methodology that couples both a greedy technique and a genetic algorithm. The genetic algorithm is instrumental in determining the optimal sequence for placing sections, ensuring each is located in its current best position. A specialized representation system is devised for both the sections and the material sheet, …promoting streamlined computation and tangible results. By balancing speed and accuracy, this study offers robust solutions for real-world nesting challenges within a reduced computational timeframe. Show more
Keywords: Genetic algorithm, 2D nesting, irregular pattern, cutting, industrial automation
DOI: 10.3233/JIFS-219345
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Ling, Lina | Wen, Mi | Wang, Haizhou | Zhu, Zhou | Meng, Xiangjie
Article Type: Research Article
Abstract: The detection of out-of-distribution (OoD) samples in semantic segmentation is crucial for autonomous driving, as deep learning models are typically trained under the assumption of a closed environment, whereas the real world presents an open and diverse set of scenarios. Existing methods employ uncertainty estimation, image reconstruction, and other techniques for OoD sample detection. We have observed that different classes may exhibit connections and associations in varying contexts. For example, objects encountered by autonomous vehicles differ in rural road scenes compared to urban environments, and the likelihood of encountering novel objects varies. This aspect is missing in current anomaly detection …methods and is vital for OoD sample detection. Existing approaches solely consider the relative significance of each prediction class, overlooking the inter-object correlation. Although prediction scores (e.g., max logits) obtained from the segmentation network are applicable for OoD sample detection, the same problem persists, particularly for OoD objects. To address this issue, we propose the utilization of the Mahalanobis distance of max logits to evaluate the final predicted score. By calculating the Mahalanobis distance, the paper aims to uncover correlations between different classes, thus enhancing the effectiveness of OoD detection. To this end, we also extend the state-of-the-art segmentation model, DeepLabV3+, to enable OoD sample detection in this paper. Specifically, this paper proposes a novel backbone network, SOD-ResNet101, for extracting contextual and multi-scale semantic information, leveraging the class correlation feature of the Mahalanobis distance to enhance the detection performance of out-of-distribution objects. Notably, our approach eliminates the need for external datasets or separate network training, making it highly applicable to existing pretraining segmentation models. Show more
Keywords: Semantic segmentation, deep learning, anomaly segmentation, automatic driving, out-of-distribution detection
DOI: 10.3233/JIFS-237799
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Kumar Sahu, Vinay | Pandey, Dhirendra | Singh, Priyanka | Haque Ansari, Md Shamsul | Khan, Asif | Varish, Naushad | Khan, Mohd Waris
Article Type: Research Article
Abstract: The Internet of Things (IoT) strategy enables physical objects to easily produce, receive, and exchange data. IoT devices are getting more common in our daily lives, with diverse applications ranging from consumer sector to industrial and commercial systems. The rapid expansion and widespread use of IoT devices highlight the critical significance of solid and effective cybersecurity standards across the device development life cycle. Therefore, if vulnerability is exploited directly affects the IoT device and the applications. In this paper we investigated and assessed the various real-world critical IoT attacks/vulnerabilities that have affected IoT deployed in the commercial, industrial and consumer …sectors since 2010. Subsequently, we evoke the vulnerabilities or type of attack, exploitation techniques, compromised security factors, intensity of vulnerability and impacts of the expounded real-world attacks/vulnerabilities. We first categorise how each attack affects information security parameters, and then we provide a taxonomy based on the security factors that are affected. Next, we perform a risk assessment of the security parameters that are encountered, using two well-known multi-criteria decision-making (MCDM) techniques namely Fuzzy-Analytic Hierarchy Process (F-AHP) and Fuzzy-Analytic Network Process (F-ANP) to determine the severity of severely impacted information security measures. Show more
Keywords: IoT attacks, fuzzy-ANP, fuzzy-AHP, MCDM, IoT vulnerabilities
DOI: 10.3233/JIFS-233759
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Chen, Jian | Cai, Zhiming | Peng, Sheng | Lu, Fei
Article Type: Research Article
Abstract: In the era of widespread connectivity, leveraging artificial intelligence models and analyzing the vast datasets generated by smart devices are central points in IoT research. While existing studies mainly focus on improving the decision-making prowess of central systems, the potential for local optimization remains largely unexplored. This paper presents an Ensemble Voting Scheme with Multilayer Dynamic Groups (EVMDS), which assigns decision weights to IoT devices based on their attribute data. By employing the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm, dynamic clusters among IoT devices can be identified, the application of ensemble voting rules at each stage of …group formation, enabling layered computations to ease backend burden and achieve hierarchical decision-making capability, facilitating regional-level decision-making that strikes a balance between local and global optimization. Through simulated decision-making scenarios in a small-scale IoT environment, our experiments demonstrate the superior accuracy and reliability of the proposed approach compared to existing models. Show more
Keywords: Local optimization, Internet-of-things, ensemble-voting, DBSCAN, dynamic grouping
DOI: 10.3233/JIFS-236899
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-10, 2024
Authors: Bochkarev, Vladimir V. | Savinkov, Andrey V. | Shevlyakova, Anna V. | Solovyev, Valery D.
Article Type: Research Article
Abstract: This work considers implementation of a diachronic predictor of valence, arousal and dominance ratings of English words. The estimation of affective ratings is based on data on word co-occurrence statistics in the large diachronic Google Books Ngram corpus. Affective ratings from the NRC VAD dictionary are used as target values for training. When tested on synchronic data, the obtained Pearson‘s correlation coefficients between human affective ratings and their machine ratings are 0.843, 0.779 and 0.792 for valence, aroused and dominance, respectively. We also provide a detailed analysis of the accuracy of the predictor on diachronic data. The main result of …the work is creation of a diachronic affective dictionary of English words. Several examples are considered that illustrate jumps in the time series of affective ratings when a word gains a new meaning. This indicates that changes in affective ratings can serve as markers of lexical-semantic changes. Show more
Keywords: Affective words, affective norms, sentiment dictionary, word valence ratings, lexical semantic change
DOI: 10.3233/JIFS-219358
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Zhang, Yingmin | Yi, Afa | Li, Shuo
Article Type: Research Article
Abstract: The constant development and application of new technologies, such as big data, artificial intelligence and the mobile Internet, have profoundly changed the personal and professional spheres. Despite these advances, finance professionals are still faced with a multitude of routine, repetitive and error-prone tasks. At the same time, they are challenged by the shift to management accounting, resulting in reduced productivity. This paper addresses these issues by introducing a financial statement filing robot developed using Robotic Process Automation (RPA) technology. The application of this robot has been shown to provide superior efficiency and accuracy, reduce the heavy burden of routine tasks, …and facilitate a smooth transition to management accounting practices. In addition, this research provides a valuable reference for the application and diffusion of RPA technology in the financial sector. Given the large amount of text data generated by financial processes, this paper proposes an automatic text categorization model. The effectiveness of the model is demonstrated as a response to address the challenges encountered in the consultation and archiving process. This contribution informs the development of text categorization robots tailored to the needs of finance professionals. Show more
Keywords: RPA technology, robot, financial statements, text classification, naive Bayes classifier model
DOI: 10.3233/JIFS-236716
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-10, 2024
Authors: Jun, Dai | Huijie, Shi | Yanqin, Li | Junwei, Zhao | Naohiko, Hanajima
Article Type: Research Article
Abstract: Cylinder liner is an internal part of the automobile engine, which plays an important role in the automobile internal combustion engine. Therefore, it is a top priority to accurately and quickly detect the cylinder liner surface defects. In order to effectively achieve the classification and localization of surface defects on the cylinder liner, this paper establishes a dataset for surface defects on cylinder liner and proposes a based on improved YOLOv5 algorithm for detecting surface defects on cylinder liner. Firstly, a machine vision system is established to acquire on-site images and perform manual annotation to build the dataset of surface …defects on cylinder liner. Secondly, the GSConv SlimNeck mechanism is introduced to reduce the model complexity; the Bi-directional Feature Pyramid Network (BiFPN) is used to fuse the feature information at different scales to enhance the detection accuracy of small surface defects on cylinder liner; and embedding the SimAM attention mechanism to focus on the object region of interest and improve the accuracy and robustness of the model. The final improved YOLOv5 model reduces the number of model parameters by 15.8% compared to the non-improved YOLOv5. And the experimental results on our self-built dataset for cylinder liner defects show that the mAP0.5 is improved by 0.4%. This means that the accuracy of model detection was not compromised. This method can be applied to actual production processes. Show more
Keywords: Cylinder liner defect detection, YOLOv5, GSConv SlimNeck, BiFPN, SimAM
DOI: 10.3233/JIFS-237793
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Hu, Man | Sun, Dezhi | Bai, Yihan | Xiao, Han | You, Fucheng
Article Type: Research Article
Abstract: In the realm of graph representation learning, Graph Neural Networks (GNNs) have demonstrated exceptional efficacy across diverse tasks. Typically, GNNs employ message-passing schemes to disseminate node features along graph structures, culminating in learned graph representations. However, their heavy reliance on smoothed node features over graph structures, coupled with limited expressiveness in the presence of node attributes, often constrains link prediction performance. To surmount this challenge, we propose GTLP, a Graph Transformer based link prediction framework. GTLP integrates unsupervised GNNs and structure encoding, enabling a holistic consideration of both topological structures and node features. This approach preserves critical node location and …role information, enhancing the model’s expressiveness. By introducing the Graph Transformer model, GTLP adeptly incorporates neighbor information, refining embedding quality and bolstering the model’s learning and generalization capabilities. Notably, our method exhibits superior scalability, accommodating diverse techniques for information extraction, embedding learning, and sampling. Experimental results underscore GTLP’s state-of-the-art performance, outpacing various baselines across five real-world datasets. Show more
Keywords: Deep learning, graph neural networks, graph transformer, link prediction
DOI: 10.3233/JIFS-237506
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Chen, Xinying | Hu, Mingjie
Article Type: Research Article
Abstract: With the rapid proliferation of substantial textual data from sources such as social media, online comments, and news articles, sentiment analysis has become increasingly crucial. However, existing deep learning methods have overlooked the significance of part-of-speech (POS) and emotional words in understanding the emotion of text. Based on this, this paper proposes a sentiment analysis approach that combines multiple features with a dual-channel network. Firstly, the vector representation of the text is obtained through Robustly Optimized BERT Pretraining Approach (RoBERTa). Secondly, the POS features and word emotional features are separately updated using self-attention to calculate weights. Concatenating words, POS and …emotion, feature dimension reduction and fusion are achieved through a linear layer. Finally, the fused feature vector is input into a dual-channel network composed of Bidirectional Gated Recurrent Unit (BiGRU) and Deep Pyramid Convolutional Neural Network (DPCNN). Experimental results demonstrate that the proposed method achieves higher classification accuracy than the comparative methods on three sentiment analysis datasets. Moreover, the experimental results fully validate the effectiveness of the proposed approach. Show more
Keywords: Sentiment analysis, part-of-speech, RoBERTa, bidirectional gated recurrent unit, deep pyramid convolutional neural network
DOI: 10.3233/JIFS-237749
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Nisha, B. Muthu | Selvakumar, J. | Nithya, V.
Article Type: Research Article
Abstract: The provision of secure and sustainable energy services is ensured by this research, also contributing to the advancement of technology align with the Sustainable Development Goals (SDGs). The motivation behind this study stems from the critical need to bolster hardware security within cutting-edge smart grid infrastructure, and more specifically, for smart energy metering technology. To address this need, this paper introduces a feasible and modular approach for enhancing the security through the implementation of a cryptographic key generator. This key generator is based on a modified Delay-based Physically Unclonable Function (PUF), which incorporates the innovative concept of a Delay Locked …Loop(DLL).The reliability of the proposed PUFs has been rigorously assessed, demonstrating impressive performance levels of 98.02% and 99.1% across a wide temperature and supply voltage, spanning from -40°C to 80°C and (3.0-3.6) V. This is showcasing exceptional functionality within the smart meter’s operational parameters.The effectiveness of this approach is confirmed through practical testing conducted on the ZYNQ-7 ZC 702 Field-Programmable Gate Array (FPGA) platform.The outcomes are encouraging by substantial uniqueness (55.96% and 56.2%) and uniformity (51.2% and 49.15%). This research significantly advances the state of the art by surpassing previous investigations into XOR Arbiter PUF (XOR APUF) and Configurable Ring Oscillator PUF (CRO PUF) designs. Furthermore, the paper delves into an examination of the proposed design’s resilience against modeling attacks, along with comprehensive security assessments. Show more
Keywords: Sustainable development goals, smart energy meter, delay locked loop, physically unclonable function, field programmable gate array
DOI: 10.3233/JIFS-240099
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Gowri, S. | Vennila, B. | Antony Crispin Sweety, C.
Article Type: Research Article
Abstract: The primary focus of this work is to develop the concept of bipolar N-neutrosophic supra topological spaces. Also, extended some concepts such as closure and interior operators of N-neutrosophic supra topological spaces to Bipolar N-neutrosophic supra topological spaces. The properties and relationship between weak forms of bipolar N-neutrosophic supra topological open sets are also established. Further, suggested several separations amongst bipolar N-neutrosophic supra sets. Some distance between bipolar N-neutrosophic sets is introduced and an efficient approachfor group multi-criteria decision making based on bipolar N-neutrosophic sets is proposed.
Keywords: Bipolar N-neutrosophic supra topology, bipolar N-neutrosophic supra α-open set, bipolar N-neutrosophic supra semi-open, bipolar N-neutrosophic supra β-open and bipolar N-neutrosophic supra pre-open, N-valued interval neutrosophic sets
DOI: 10.3233/JIFS-224450
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Vallejos, Sebastian | Armentano, Marcelo G. | Berdun, Luis | Schiaffino, Silvia | González Císaro, Sandra | Nigro, Oscar | Balduzzi, Leonardo | Cuesta, Ignacio
Article Type: Research Article
Abstract: Product classification is a critical task for the smooth running of the purchase process in e-commerce websites. When it comes to P2P marketplaces, users can act both as sellers and as buyers, and they need to assign predefined categories to the products they want to sell. Besides being tedious for users, this task can result in ambiguous or inaccurate assignments. This article presents a method for the automatic categorization of items offered in a local P2P marketplace using a multi-level classification approach. Our experiments demonstrated a significant improvement in the classification results of the proposed solution compared to a traditional …direct classification approach. Show more
Keywords: Classification, e-commerce, NLP, P2P marketplace
DOI: 10.3233/JIFS-219344
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Brännström, Andreas | Nieves, Juan Carlos
Article Type: Research Article
Abstract: This paper introduces an automated decision-making framework for providing controlled agent behavior in systems dealing with human behavior-change. Controlled behavior in such settings is important in order to reduce unexpected side-effects of a system’s actions. The general structure of the framework is based on a psychological theory, the Theory of Planned Behavior (TPB), capturing causes to human motivational states, which enables reasoning about dynamics of human motivation. The framework consists of two main components: 1) an ontological knowledge-base that models an individual’s behavioral challenges to infer motivation states and 2) a transition system that, in a given motivation state, decides …on motivational support, resulting in transitions between motivational states. The system generates plans (sequences of actions) for an agent to facilitate behavior change. A particular use-case is modeled regarding children with Autism Spectrum Conditions (ASC) who commonly experience difficulties in everyday social situations. An evaluation of a proof-of-concept prototype is performed that presents consistencies between ASC experts’ suggestions and plans generated by the system. Show more
Keywords: Interactive agents, strategic decision-making, behavior-change systems, theory of planned behavior, Autism
DOI: 10.3233/JIFS-219335
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Li, Fuxue | Chi, Chuncheng | Yan, Hong | Zhang, Zhen | Zhao, Zhongchao
Article Type: Research Article
Abstract: Transformer-based neural machine translation (NMT) models have achieved state-of-the-art performance in the machine translation paradigm. These models learn the translation knowledge from the bilingual corpus through the attention mechanism automatically. This differs from the way human translators approach sentence translation, where prior knowledge plays a significant role. Inspired by this, a word translation augmentation (WTA) method is proposed to improve the Transformer-based NMT model. The main steps are as follows: Firstly, constructing the word alignment rules based on the training set. Next, generating the translation rules for source words according to the word alignment rules. Lastly, incorporating the potential translation …candidates for each source word into the NMT model during the training and testing procedure. In addition, the WTA method introduces the idea of Mixup for translation candidates of a source word and employs two augmentation strategies to augment the encoder. The results of experiments on several translation tasks with high-resource and low-resource indicate the effectiveness of the proposed method compared with the corresponding strong baseline, and the improvement in BLEU score achieved ranges from 0.42 to 0.63. Show more
Keywords: Neural machine translation, transformer, word embedding, word translations
DOI: 10.3233/JIFS-236170
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Jia, Liu
Article Type: Research Article
Abstract: This study explores a predictive approach using a combination of a one-dimensional convolutional neural network and support vector machine to enhance the management of cultural product trade between China and South Korea, addressing the trade deficit challenge. The methodology involves the collection and categorization of diverse data related to the trade of cultural products between the two countries, identifying data mining directions. The research incorporates the design of association rule functions to identify viable data sources, and employs a hybrid data clustering algorithm integrating technology and spectral clustering to cluster available data. The features extracted from the data mining process …are utilized as learning samples for trade prediction. Both a one-dimensional convolutional neural network and support vector machine are employed to model and predict cultural product trade between China and South Korea. Experimental results demonstrate the method’s accuracy in predicting trade situations under parameterized conditions. Throughout the prediction process, credibility measurement values and controllable correlation degrees consistently exceed 19 and 12.5, respectively, while uncertainty discrimination degrees and error coefficients remain below 12 and 6. Show more
Keywords: Big data integration, Chinese and Korean cultural products, trade prediction, data mining, convolutional neural network, support vector machine
DOI: 10.3233/JIFS-238061
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: López-López, Aurelio | Garcıa-Gorrostieta, Jesús Miguel | González-López, Samuel
Article Type: Research Article
Abstract: Emotion detection in educational dialogues, particularly within student-teacher interactions, has become a crucial research area for improving the learning experience. In this paper, we employ two models, one generic Bidirectional Encoder Representations from Transformers (BERT) and the Emotion detection model Robustly Optimized BERT Approach (EmoRoBERTa), to automatically classify emotions in a corpus of student-teacher chat interactions. Then subsequently, we validate these classifications using a scheme based on oracles, employing two generative large language models (ChatGPT and Bard). Experiments on emotion detection in dialogues between students and teachers revealed that EmoRoBERTa exhibited a reasonable level of agreement with the oracles, while …ChatGPT demonstrated the highest consistency with EmoRoBERTa’s predictions. Furthermore, we identified the impact of specific words on emotion classification, offering insights into the decision-making process of these models. The results not only highlight the prominent presence of emotions like approval, gratitude, curiosity, disapproval, amusement, confusion, remorse, joy , and surprise but also provide substantial support for the utilization of the proposed emotion detection model to enhance the student learning environment. Exploring the emotional aspects of educational dialogues holds the potential to enhance instruction methods, provide timely assistance to students in need, and create an improved learning atmosphere. Show more
Keywords: Emotion detection, learning interaction, transfer learning, large language models, active learning
DOI: 10.3233/JIFS-219340
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Ratha, Ashoka Kumar | Behera, Santi Kumari | Devi, A. Geetha | Barpanda, Nalini Kanta | Sethy, Prabira Kumar
Article Type: Research Article
Abstract: With the rise of the fruit processing industry, machine learning and image processing have become necessary for quality control and monitoring of fruits. Recently, strong vision-based solutions have emerged in farming industries that make inspections more accurate at a much lower cost. Advanced deep learning methods play a key role in these solutions. In this study, we built an image-based framework that uses the ResNet-101 CNN model to identify different types of papaya fruit diseases with minimal training data and processing power. A case study to identify commonly encountered papaya fruit diseases during harvesting was used to support the results …of the suggested methodology. A total of 983 images of both healthy and defective papaya were considered during the experiment. In this study, we initially used the ResNet-101 CNN model for classification and then combined the deep features drawn out from the activation layer (fc1000) of the ResNet-101 CNN along with a multi-class Support Vector Machine (SVM) to classify papaya fruit defect detection. After comparing the performance of both approaches, it was found that Cubic SVM is the best classifier using the deep feature of ResNet-101 CNN, achieved with an accuracy of 99.5% and an area under the curve (AUC) of 1 without any classification error. The findings of this experiment reveal that the ResNet-101 CNN with the cubic SVM model can categorize good, diseased, and defective papaya pictures. Moreover, the suggested model executed the task in a greater way in terms of the F1- Score (0.99), sensitivity (99.50%), and precision (99.71%). The present work not only assists the end user in determining the type of disease but also makes it possible for them to take corrective measures during farming. Show more
Keywords: Disease classification, CNN (Convolutional Neural Network), ResNet-101, ML (Machine Learning), SVM (Support Vector Machine)
DOI: 10.3233/JIFS-239875
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-17, 2024
Authors: Shi, Xiaolong | Kosari, Saeed | Rangasamy, Parvathi | Nivedhaa, R.K. | Rashmanlou, Hossein
Article Type: Research Article
Abstract: Modern image processing techniques are improving beyond old methods, which include advanced approaches, for example deep learning. Convolutional Neural Networks (CNNs) are excellent at automatic feature extraction, whereas Generative Adversarial Networks (GANs) produce realistic images. Transfer learning uses pre-trained models, whereas semantic segmentation identifies pixels in images. Super-resolution, style transfer, and attention mechanisms can increase the quality of images and understanding. Adversarial defenses address purposeful manipulations, while 3D image processing handles three-dimensional data. These advancements make use of improved computational power and massive datasets to revolutionize image processing capabilities. Traditional image processing algorithms frequently fail to handle the complex and …multidimensional structure of color images, particularly when dealing with uncertainty and imprecision. In this study, the 3D-EIFIM frame work is extented and scaled aggregation operations 3D-EIFIM tailored for image data are proposed. By representing each pixel as an entry of 3D-EIFIM and applying aggregation techniques to enable more effective image analysis, manipulation, and enhancement. The practical implications of this research are significant, as it can lead to advancements in fields such as computer vision, medical imaging, and remote sensing. Show more
Keywords: IFP, conjunction, disjunction, IFIM, EIFIM, 3D-IFIM, 3D-EIFIM
DOI: 10.3233/JIFS-238252
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-17, 2024
Authors: Ruby Elizabeth, J. | Kesavaraja, D. | Ebenezer Juliet, S.
Article Type: Research Article
Abstract: The retinal illness that causes vision loss frequently on the globe is glaucoma. Hence, the earlier detection of Glaucoma is important. In this article, modified AlexNet deep leaning model is proposed to category the source retinal images into either healthy or Glaucoma through the detection and segmentations of optic disc (OD) and optic cup (OC) regions in retinal pictures. The retinal images are preprocessed and OD region is detected and segmented using circulatory filter. Further, OC regions are detected and segmented using K-means classification algorithm. Then, the segmented OD and OC region are classified and trained by the suggested AlexNet …deep leaning model. This model classifies the source retinal image into either healthy or Glaucoma. Finally, performance measures have been estimated in relation to ground truth pictures in regards to accuracy, specificity and sensitivity. These performance measures are contrasted with the other previous Glaucoma detection techniques on publicly accessible retinal image datasets HRF and RIGA. The suggested technique as described in this work achieves 91.6% GDR for mild case and also achieves 100% GDR for severe case on HRF dataset. The suggested method as described in this work achieves 97.7% GDR for mild case and also achieves 100% GDR for severe case on RIGA dataset. AIM: Segmenting the OD and OC areas and classifying the source retinal picture as either healthy or glaucoma-affected. METHODS: The retinal images are preprocessed and OD region is detected and segmented using circulatory filter. Further, OC region is detected and segmented using K-means classification algorithm. Then, the segmented OD and OC region classified are and trained by the suggested AlexNet deep leaning model. RESULTS: The suggested method as described in this work achieves 91.6% GDR for mild case and also achieves 100% GDR for severe case on HRF dataset. The suggested method as described in this work achieves 97.7% GDR for mild case and also achieves 100% GDR for severe case on RIGA dataset. CONCLUSION: This article proposes the modified AlexNet deep learning models for the detections of Glaucoma utilizing retinal images. The OD region is detected using circulatory filter and OC region is detected using k-means classification algorithm. The detected OD and OC regions are utilized to classify the retinal images into either healthy or Glaucoma using the suggested AlexNet model. The proposed method obtains 100% Sey, 93.7% Spy and 96.6% CA on HRF dataset retinal images. The proposed AlexNet method obtains 97.7% Sey, 98% Spy and 97.8% CA on RIGA dataset retinal images. The proposed method stated in this article achieves 91.6% GDR for mild case and also achieves 100% GDR for severe case on HRF dataset. The suggested method as described in this work achieves 97.7% GDR for mild case and also achieves 100% GDR for severe case on RIGA dataset. Show more
Keywords: Retina, deep learning, OD, OC, AlexNet
DOI: 10.3233/JIFS-234131
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Liu, Kai | Wang, Mingyi
Article Type: Research Article
Abstract: China has emerged as one of the nations with the worst air pollution in recent years. The severe air pollution has caused a large number of population migration and also caused serious economic problems. Since the concentration of air pollutants can change quickly in a short amount of time, the study first tracked PM2.5 , PM10 , NO2 , CO, SO2 , and O3 as targets before using the particle swarm optimization algorithm to improve the PIO algorithm, which is based on the traditional pigeon swarm algorithm. To estimate the concentration of air pollutants, combine the wavelet packet decomposition …technique, MDS visualization method, and k-means algorithm. Then, apply the enhanced PIO algorithm to optimize the ELM algorithm. Finally, a new type of decomposition-optimization-clustering-integration hybrid learning model, namely DOCIAPC model, is constructed. The experimental findings indicate that, when predicting the concentration of various air pollutants, the DOCIAPC model’s average direction prediction accuracy is 90.37% . In conclusion, the model suggested in the study has excellent performance and applicability, and it can accurately predict the concentration of air pollutants, help the government take action to reduce air pollution, balance the environment and economy, as well as the allocation of labor and its resources in the city. Show more
Keywords: Air pollution, wavelet packet decomposition, pigeon group algorithm, K-means algorithm, MDS, labor force
DOI: 10.3233/JIFS-235902
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Wang, Lu
Article Type: Research Article
Abstract: In this technology world, education is also becoming one of the basic necessities of human life like food, shelter, and clothes. Even in day-to-day daily activities, the world is moving toward an automated process using technology developments. Some of the technology developments in day-to-day life activities are smartphone, internet activities, and home and office appliances. To cope with these advanced technologies, the persons must have basic educational qualification to understand and operate those appliances easily. Apart from this, the education helps the person to develop their personal growth in both knowledge and wealth. With the development of technologies, different Artificial …Intelligence techniques have been applied on the datasets to analyze these factors and enhance the teaching method. But the current techniques were applied to one or two data models that analyze either their educational performance or demographic variable. But these models were not sufficient for analyzing all the factors that affects the education. To overcome this, a single optimized machine-learning approach is proposed in this paper to analyze the factors that affect the education. This analysis helps the faculty to enhance their teaching methodology and understand the student’s mentality toward education. The proposed Hybrid Cuckoo search-particle swarm optimization was implemented on three datasets to determine the factors that affect the education. These optimal factors are determined by identifying their relations to the final results of an individual person. All these optimal factors are combined and grades are grouped to analyze the proposed optimization process performance using regression neural network. The proposed optimization-based neural network was tested on three data models and its performance analysis showed that the proposed model can achieve higher accuracy of 99% that affects the individual education. This shows that the proposed model can help the faculty to enhance their attention to the students individually. Show more
Keywords: Education, demographic factors, optimization, hybrid, cuckoo search optimization, particle swarm, regression neural network
DOI: 10.3233/JIFS-234021
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Ramasamy, Uma | Santhoshkumar, Sundar
Article Type: Research Article
Abstract: In the expansive domain of data-driven research, the curse of dimensionality poses challenges such as increased computational complexity, noise sensitivity, and the risk of overfitting models. Dimensionality reduction is vital to handle high-dimensional datasets effectively. The pilot study disease dataset (PSD) with 53 features contains patients with Rheumatoid Arthritis (RA) and Osteoarthritis (OA). Our work aims to reduce the dimension of the features in the PSD dataset, identify a suitable feature selection technique for the reduced-dimensional dataset, analyze an appropriate Machine Learning (ML) model, select significant features to predict the RA and OA disease and reveal significant features that predict …the arthritis disease. The proposed study, Progressive Feature Reduction with Varied Missing Data (PFRVMD), was employed to reduce the dimension of features by using PCA loading scores in the random value imputed PSD dataset. Subsequently, notable feature selection methods, such as backward feature selection, the Boruta algorithm, the extra tree classifier, and forward feature selection, were implemented on the reduced-dimensional feature set. The significant features/biomarkers are obtained from the best feature selection technique. ML models such as the K-Nearest Neighbour Classifier (KNNC), Linear Discriminant Analysis (LDA), Logistic Regression (LR), Naïve Bayes Classifier (NBC), Random Forest Classifier (RFC) and Support Vector Classifier (SVC) are used to determine the best feature selection method. The results indicated that the Extra Tree Classifier (ETC) is the promising feature selection method for the PSD dataset because the significant features obtained from ETC depicted the highest accuracy on SVC. Show more
Keywords: Autoimmune disease, rheumatoid arthritis, osteoarthritis, feature reduction, feature selection, machine learning algorithms
DOI: 10.3233/JIFS-231537
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Elsabagh, M.A. | Emam, O.E. | Medhat, T. | Gafar, M.G.
Article Type: Research Article
Abstract: By anticipating system defect-prone units, software-developing businesses aim to increase the quality of software. Despite the development of numerous Data Mining (DM) and Artificial Intelligence (AI) techniques in the Software Defect Prediction (SDP) field, dealing with the uncertainty of datasets persists due to noise, data distribution, class overlapping, proposed model parameters, and old data. This uncertainty issue has a negative impact on the accuracy of software defect prediction. To overcome this limitation, a model-based hybridization of Ant Colony Optimization-inspired Fuzzy Rough Feature Selection (FRAC) followed by adapting the parameters of Adaptive Neuro-Fuzzy Inference System (ANFIS) with a novel algorithm called …Turbulent Flow of Water Optimization (TFWO) is recommended. The proposed model (FRAC+TFWANFIS) performed better than contemporary literature and other optimization algorithms in SDP, such as Ant Colony Optimization (ACO), Differential Evolution (DE), ANFIS, Grey Wolf Optimization (GWO), Particle Swarm Optimization (PSO), and Genetic Algorithm (GA). Also, the performance of the proposed model is superior to that of other conventional classification techniques such as Naïve Bayes (NB), Logistic Regression (LR), Multilayer Perceptron (MLP), Support Vector Machine (SVM), Fuzzy Rough Nearest Neighbor (FRNN), Fuzzy Nearest Neighbor (FNN), Bagging, C4.5, Random Forest (RF), and K-Nearest Neighbor (K-NN). Two datasets, PC3 and PC4, with large dimensions from the OPENML platform are used. The experiments are applied with regard to accuracy, Standard Deviation (SD), Root Mean Square Error (RMSE), Mean Square Error (MSE), and other measurement metrics. The uncertainty issue is addressed by the (FRAC+TFWANFIS) model with accuracy 90.8% and 91.1% for PC3 and PC4, respectively. Show more
Keywords: Adaptive Neuro-Fuzzy Inference System (ANFIS), Turbulent Flow of Water Optimization Algorithm, Software Defect Prediction (SDP), Recent and Conventional Optimization Algorithms, Uncertainty of SDP.
DOI: 10.3233/JIFS-234415
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-21, 2024
Authors: Sun, Yilin | Li, Shufan
Article Type: Research Article
Abstract: Contemporary art design not only pursues the quality of the work itself, but also pays attention to the sensory aspects of people’s needs for art design. Traditional art design methods can be limited by time, space and other objective conditions, and often fail to achieve the designer’s expected effect, and visitors’ experience is not strong. The usage of multimedia technology in art and design can enrich its expression and enhance visitors’ experience. In order to increase the sense of interaction between the platform and users, multimedia technology is incorporated into the interactive art design platform generated by VR technology in …this paper. This article combines multimedia technology with interactive technology to construct an interactive platform for art and design, and applies it to the display of Dunhuang murals. Through the analysis of user experience feedback, the effectiveness of art and design display and interaction is verified. Display and interact with Dunhuang murals as interactive platform applications. This test is to extract women’s clothing colors from the same tradition in different times in the color extraction exploration module of the interactive platform, so as to provide accurate information for displaying women’s clothing color changes and comparing interactions. The findings show that the platform is capable of extracting and recognizing the color characteristics of the murals, accurately identifying user signals, and noticing 3D modeling of images via VR technology. This capability provides solid technical and data support for the platform’s interaction module. The interaction design, platform functionality, and layout can support the majority of users in terms of cognition, perception, and interaction, pique their interest, and enhance their experience, according to evaluation of trial user information. The interaction ends abruptly, according to a small percentage of users, and they had a bad experience overall. Show more
Keywords: Multimedia technology, art and design, interactive, platform building
DOI: 10.3233/JIFS-238001
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Sheik Faritha Begum, S. | Suresh Anand, M. | Pramila, P.V. | Indra, J. | Samson Isaac, J. | Alagappan, Chockalingam | Gopala Gupta, Amara S.A.L.G. | Srivastava, Suraj | Vidhya, R.G.
Article Type: Research Article
Abstract: Thyroid tumours are a common form of cancer, and accurate classification of their type is crucial for effective treatment planning. This research presents a hybrid approach for the classification of thyroid tumours based on their type. The proposed approach combines the use of advanced machine learning techniques with a comprehensive database of thyroid tumour samples. The database includes various features such as tumour size, shape, and texture, as well as patient-specific information. The hybrid approach aims to optimize the classification process by leveraging the diverse set of features and utilizing the power of machine learning algorithms. By harnessing the power …of machine learning algorithms, this approach has the potential to revolutionize the field of thyroid tumour classification and significantly improve patient outcomes. The optimization strategy is Particle Swarm Optimization, refining the classification performance and ensuring optimal accuracy in identifying and categorizing four types of thyroid tumours. The utilization of advanced diagnostic tools and state-of-the-art Random forest classifier techniques in this approach marks a significant advancement in the field of thyroid tumour classification. Through the augmentation of the dataset and the pre-processing techniques employed, the hybrid classification system demonstrates enhanced accuracy and reliability in distinguishing between different types of thyroid tumours. This innovative approach not only provides a more comprehensive understanding of thyroid tumours but also paves the way for personalized and effective treatment strategies, ultimately improving patient care and outcomes. Show more
Keywords: Machine learning, thyroid tumours, Particle Swarm Optimization, Random Forest classifier, innovative approach
DOI: 10.3233/JIFS-239804
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Hou, Junjian | Zhang, Bingyu | Zhong, Yudong | Zhao, Dengfeng | He, Wenbin | Zhou, Fang
Article Type: Research Article
Abstract: Online monitoring of cutting tools wear is an important component of advanced manufacturing technology, which can greatly improve the processing efficiency and reduce the production cost. In this paper, a cutting tools wear state prediction method based on acoustic imaging recognition is developed. By applying the advantages of the functional generalized inverse beamforming method in the sound field reconstruction, the acoustic signal is used as the carrier to reconstruct the three-dimensional space radiated sound field. And then, slice the reconstructed sound field image and input it into the convolutional neural network model as a sample, to process and classify the …image and mines the feature information related to state from the sound field image. By incorporating amplitude and phase information of the sound field, the presented method utilizes spatial domain mapping to accurately identify the noise source and address challenges such as low recognition rate and difficult diagnosis under weak fault conditions. Furthermore, the paper also demonstrates the recognition of sound field states through a fault experiment in sound box simulation, based on these theories. And the recognition of sound field states is achieved through a simulation fault experiment conducted on the sound box, thereby validating the feasibility of the state monitoring method based on pattern recognition of sound and image. Finally, the experimental object is selected as the four-edge carbide milling cutter, and the cutting tools wear state is monitored by integrating sound field reconstruction techniques with convolution feature extraction methods to validate the robustness of the proposed approach. Show more
Keywords: Functional generalized inverse beamforming, convolutional neural network, sound field reconstruction, state detection, acoustic imaging technology
DOI: 10.3233/JIFS-238755
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-19, 2024
Authors: Zhang, Jianwei | Chen, Lei | Hou, Ge | Huang, Jinlin | Wang, Yong
Article Type: Research Article
Abstract: Health assessment is one of the important theoretical bases for deciding whether the diversion tunnel can operate safely and stably. A project of the TBM diversion tunnel is taken as the research object to ensure the normal operation of the diversion tunnel. Based on measured data and considering multiple safety aspects such as structural response, durability, and external factors of the diversion tunnel, a TBM diversion tunnel structural health evaluation index system is established. A new method for the TBM diversion tunnel structural health comprehensive evaluation based on Analytic Hierarchy Process-Matter Element Extension-Variable Weight Theory (AMV) is proposed to explore …the impact of AMV fluctuation with the measured results of the indicators on the weight, closeness, and health grade of each evaluation index. The high sensitivity and high-risk evaluation indicators for the structural health of the diversion tunnels are identified. It is found that the variable weight varies with the changes in various indicator values, which can accurately evaluate the health status of tunnels in real-time. The characteristic values of the tunnel grade calculated by the AHP and the AMV are 1.589 and 1.695, respectively. The results of the corresponding interval diversion tunnel are the basic safety state of grade B. Except for the two evaluation indicators of concrete strength and slurry properties, the variable weight values and grade characteristic values of other evaluation indicators increase with the increase of indicator values. The four indicators of segment settlement, segment opening, segment misalignment, and segment cracks are more sensitive to the health of the TBM diversion tunnel. This AMV can accurately evaluate the health status of the diversion tunnel structure. The research results can provide references for later maintenance work and similar projects. Show more
Keywords: Diversion tunnel, Health evaluation, AMV, AHP, susceptibility
DOI: 10.3233/JIFS-239155
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Li, Yuerong | Zhang, Yuhua | Che, Jinxing
Article Type: Research Article
Abstract: Accurate prediction of short-term electricity price is the key to obtain economic benefit and also an important index of power system planning and management. Support vector regression (SVR) based ensemble works have gained remarkable achievements in terms of high accuracy and steady performance, but they are highly dependent on data representativeness and have a high computational complexity O (k * N 3 ) of data samples and parameter selection. To further improve the data representativeness and reduce its computational complexity, this paper develops a new approach to forecast electricity price via optimal weighted ensemble. In the model, the cluster-based subsampling …algorithm is proposed to categorize the inputs being seasonally decomposed into several groups, and representative data are drawn from each group in a certain proportion to ensure that each subset trained with SVR has the same representativeness and features. Moreover, the optimal weighted combination method is presented to assign weights to the sub-SVRs to obtain the optimal support vector regression ensemble model (OWSSVRE). The experimental results show that the improved support vector regression ensemble model with the same features and representativeness of the subset has better performance in electricity price forecasting. As a result, it is suitable to support decision making in the energy and other sectors. Show more
Keywords: Electricity price forecasting, support vector regression, K-means clustering, optimal weight, subsampling
DOI: 10.3233/JIFS-236239
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-16, 2024
Authors: Thenmozhi, R. | Sakthivel, P. | Kulothungan, K.
Article Type: Research Article
Abstract: The Internet of Things and Quantum Computing raise concerns, as Quantum IoT defines security that exploits quantum security management in IoT. The security of IoT is a significant concern for ensuring secure communications that must be appropriately protected to address key distribution challenges and ensure high security during data transmission. Therefore, in the critical context of IoT environments, secure data aggregation can provide access privileges for accessing network services. "Most data aggregation schemes achieve high computational efficiency; however, the cryptography mechanism faces challenges in finding a solution for the expected security desecration, especially with the advent of quantum computers utilizing …public-key cryptosystems despite these limitations. In this paper, the Secure Data Aggregation using Quantum Key Management scheme, named SDA-QKM, employs public-key encryption to enhance the security level of data aggregation. The proposed system introduces traceability and stability checks for the keys to detect adversaries during the data aggregation process, providing efficient security and reducing authentication costs. Here the performance has been evaluated by comparing it with existing competing schemes in terms of data aggregation. The results demonstrate that SDA-QKM offers a robust security analysis against various threats, protecting privacy, authentication, and computation efficiency at a lower computational cost and communication overhead than existing systems. Show more
Keywords: Internet of things, security, data aggregation, access control, quantum cryptography
DOI: 10.3233/JIFS-223619
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-16, 2024
Authors: Li, Chen | Liu, Na | Xu, Zhenshun | Zheng, Guofeng | Yang, Jie | Dao, Lu
Article Type: Research Article
Abstract: Medical short text classification is of great significance to medical information extraction and medical auxiliary diagnosis. However, medical short texts face challenges such as sparse features, semantic ambiguity, and the specialized nature of the medical field, resulting in relatively low accuracy in short text classification. Taking into consideration the characteristics of medical short texts, this paper proposes a Chinese medical short text classification model based on DPECNN. First, ERNIE is utilized to learn text knowledge and information in order to enhance the model’s semantic representation capabilities. Then, the DPECNN model is employed to extract rich feature information, and the classification …results are generated through a fully connected layer. In the case of DPCNN, it only considers deep-level contextual semantic information, overlooking the correlation of adjacent semantic information between channels. To address this, ECA channel attention is introduced to account for adjacent semantic information. The use of a self-normalizing activation function helps avoid the problem of vanishing gradients. To enhance the model’s robustness and generalization ability, the FGM adversarial training algorithm is employed to perturb the data. The F1 values achieved on the THUCNews, KUAKE-QIC, and CHIP-CTC datasets are 95.00%, 79.45%, and 82.81%, respectively. Show more
Keywords: Medical text mining, Chinese short text classification, ERNIE, DPECNN, confrontation training
DOI: 10.3233/JIFS-239006
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Du, Rong | Cheng, Yan
Article Type: Research Article
Abstract: This research paper highlights the significance of vehicle detection in aerial images for surveillance systems, focusing on deep learning methods that outperform traditional approaches. However, the challenge of high computation complexity due to diverse vehicle appearances persists. The motivation behind this study is to highlight the crucial role of vehicle detection in aerial images for surveillance systems, emphasizing the superior performance of deep learning methods compared to traditional approaches. To address this, a lightweight deep neural network-based model is developed, striking a balance between accuracy and efficiency enabling real-time operation. The model is trained and evaluated on a standardized dataset, …with extensive experiments demonstrating its ability to achieve accurate vehicle detection with significantly reduced computation costs, offering a practical solution for real-world aerial surveillance scenarios. Show more
Keywords: Aerial images, vehicle detection, surveillance system, deep learning, real-time processing
DOI: 10.3233/JIFS-236059
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Pavithra, R. | Ramachandran, Prakash
Article Type: Research Article
Abstract: The Hilbert spectrum images of intrinsic mode functions (IMF) of empirical mode decomposition (EMD) analysis and variational mode decomposition (VMD) analysis of faulty machine vibration signals are used in deep convolutional neural network (DCNN) for machine fault classification in which the DCNN automatically learns the features from spectral images using convolution layer. Though both EMD and VMD analysis suit well for non-stationary signal analysis, VMD has the merit of aliasing free IMFs. In this paper, the performance improvement of DCNN classification for a non-stationary vibration signal dataset using VMD is brought out. The numerical experiment uses the Hilbert spectrum images …of 4 EMD-IMFs and 4 VMD-IMFs in DCNN to classify 10 different faults of the Case Western Reserve University (CWRU) bearing dataset. The confusion matrices are obtained and the plot of model accuracies in terms of epochs for the DCNN is analysed. It is shown that the spectrum images of one of the four EMD-IMFs, IMF0 , give a validation accuracy of 100% and in the case of VMD the spectrum images of two of the four VMD-IMFs, IMF0 , and IMF1 give a validation accuracy of 100%. This reveals that non-aliasing IMFs of VMD are better at classifying bearing faults. Further to bring out the merits of VMD analysis for non-stationary signals the numerical experiment is conducted using VMD analysis for binary fault classification of the milling dataset which is more non-stationary than the bearing dataset which is proved by plotting the statistical parameters of both datasets against time. It is found that the DCNN classification is 100% accurate for IMF3 of VMD analysis which is much better than the 81% accuracy provided by EMD analysis as per existing literature. The performance comparison highlights the merits of VMD analysis over EMD analysis and other state-of-the-art methods and ensemble learning methods. Show more
Keywords: Deep convolution neural network, empirical mode decomposition, hilbert transform, intrinsic mode function, variational mode decomposition, ensemble learning
DOI: 10.3233/JIFS-237546
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-19, 2024
Authors: Nawshin, Sabila | Islam, Salekul | Shatabda, Swakkhar
Article Type: Research Article
Abstract: Software Defined Networking (SDN) proposes a centralized network paradigm where a central controller manages the network. While this centralizes scheme opens up previously unachievable opportunities, it also makes the network more susceptible to a varying range of cyber threats. The development of effective Intrusion Detection Systems (IDS) designed for the SDN topology is a critical need to address the different vulnerabilities SDN faces. Towards that purpose, the inSDN dataset was specifically curated for intrusion detection in SDN with various attack scenarios unique to the SDN topology. This study leveraged the inSDN dataset to introduce an innovative Intrusion Detection …System (IDS) model that amalgamates Principal Component Analysis (PCA), a dimensionality reduction technique widely employed in traditional Machine Learning (ML) to extract the principal features of the dataset and couples it with Artificial Neural Networks (ANN) to classify network traffic based on the extracted features. The proposed model attains an exceptional accuracy rate of 99.95% for multi-class classification and demonstrate that it surpasses the current state-of-the-art techniques while operating within a much simpler framework. This significantly diminishes the necessity for complex models that demand extensive computational resources when dealing with the inSDN attack dataset. The analysis of the dataset carried out in this study also provides insights into the redundancy present in the dataset and identifies the core features that contains most of the information in the dataset. Show more
Keywords: Software Defined Networking (SDN), Intrusion Detection Systems (IDS), Principle Component Analysis (PCA), Artificial Neural Network (ANN)
DOI: 10.3233/JIFS-236340
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-18, 2024
Authors: Kumar, Geethu S. | Ankayarkanni, B.
Article Type: Research Article
Abstract: Facial Emotion Recognition (FER) is a powerful tool for gaining insights into human behaviour and well-being by precisely quantifying a wide range of emotions especially stress, through the analysis of facial images. Detecting stress using FER entails meticulously examining subtle facial cues, such as changes in eye movements, brow furrowing, lip tightening, and muscle contractions. To assure effectiveness and real-time processing, FER approaches based on deep learning and artificial intelligence (AI) techniques was created using edge modules. This research introduces a novel approach for identifying stress, leveraging the Conv-XGBoost Algorithm to analyse facial emotions. The proposed model sustain rigorous evaluation …techniques, for employing key metrics examination such as the F1 score, validation accuracy, precision, and recall rate to assess its real-world reliability and robustness. This comprehensive analysis and validation proved the model’s practical utility in facial analysis. Integrating the Conv-XGBoost Algorithm with facial emotion analysis represents a promising and highly accurate solution for efficient stress detection. The method surpasses existing literature and demonstrate significant potential for practical applications based on well-validated data. Show more
Keywords: Stress, emotion recognition, Conv-XGBoost, deep learning, facial expression
DOI: 10.3233/JIFS-237820
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Martínez Felipe, Miguel de JesÚs | Martínez Castro, JesÚs Alberto | Montiel Pérez, JesÚs Yaljá | Chaparro Amaro, Oscar Roberto
Article Type: Research Article
Abstract: In this work, the image block matching based on dissimilarity measure is investigated. Moreover, an unsupervised approach is implemented to yield that the algorithms have low complexity (in numbers of operations) compared to the full search algorithm. The state-of-the-art experiments only use discrete cosine transform as a domain transform. In addition, some images were tested to evaluate the algorithms. However, these images were not evaluated according to specific characteristics. So, in this paper, an improved version is presented to tackle the problem of dissimilarity measure in block matching with a noisy environment, using another’s domain transforms or low-pass filters to …obtain a better result in block matching implementing a quantitive measure with an average accuracy margin of ± 0.05 is obtained. The theoretical analysis indicates that the complexity of these algorithms is still accurate, so implementing Hadamard spectral coefficients and Fourier filters can easily be adjusted to obtain a better accuracy of the matched block group. Show more
Keywords: Block matching, Walsh-Hadamard discrete transform, Fourier filter, dissimilarity measure, unsupervised machine learning
DOI: 10.3233/JIFS-219341
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Ensastegui-Ortega, Maria Elena | Batyrshin, Ildar | Cárdenas–Perez, Mario Fernando | Kubysheva, Nailya | Gelbukh, Alexander
Article Type: Research Article
Abstract: In today’s data-rich era, there is a growing need for developing effective similarity and dissimilarity measures to compare vast datasets. It is desirable that these measures reflect the intrinsic structure of the domain of these measures. Recently, it was shown that the space of finite probability distributions has a symmetric structure generated by involutive negation mapping probability distributions into their “opposite” probability distributions and back, such that the correlation between opposite distributions equals –1. An important property of similarity and dissimilarity functions reflecting such symmetry of probability distribution space is the co-symmetry of these functions when the similarity between probability …distributions is equal to the similarity between their opposite distributions. This article delves into the analysis of five well-known dissimilarity functions, used for creating new co-symmetric dissimilarity functions. To conduct this study, a random dataset of one thousand probability distributions is employed. From these distributions, dissimilarity matrices are generated that are used to determine correlations similarity between different dissimilarity functions. The hierarchical clustering is applied to better understand the relationships between the studied dissimilarity functions. This methodology aims to identify and assess the dissimilarity functions that best match the characteristics of the studied probability distribution space, enhancing our understanding of data relationships and patterns. The study of these new measures offers a valuable perspective for analyzing and interpreting complex data, with the potential to make a significant impact in various fields and applications. Show more
Keywords: Dissimilarity function, co-symmetry, correlation, probability distribution, negation
DOI: 10.3233/JIFS-219363
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-10, 2024
Authors: Xu, Zhigang | Li, Yugen
Article Type: Research Article
Abstract: Construction site environment helmet detection is of great significance for protecting workers’ lives and realizing the automation of safety management. Aiming at the current object detection methods for the complex construction site environment in the small-scale helmet object detection ability is insufficient. This paper proposes a construction site environment helmet detection method based on multi-scale context and attention fusion. The method is able to aggregate the multi-scale contextual semantics of deep image features through the proposed multi-scale context module and expand the receptive field in order to improve the network’s discriminative learning ability for small-scale helmet objects. Meanwhile, the proposed …attention feature fusion module dynamically fuses features from shallow features and network decoding features to enhance the network’s ability to learn the expression of global feature dependencies and local spatial detail features of helmet objects, and further improve the network’s detection precision of helmet objects. The experimental results show that on the constructed safety helmet wearing dataset, the proposed method in this paper has good detection effect and balanced detection speed compared with the existing mainstream object detection methods. Show more
Keywords: Construction site, helmet detection, CenterNet, multi-scale context, attention feature fusion
DOI: 10.3233/JIFS-236385
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Wei, Tao | Yang, Changchun | Zheng, Yanqi | Zhang, Jingxue
Article Type: Research Article
Abstract: Recently, Graph Neural Networks (GNNs) using aggregating neighborhood collaborative information have shown effectiveness in recommendation. However, GNNs-based models suffer from over-smoothing and data sparsity problems. Due to its self-supervised nature, contrastive learning has gained considerable attention in the field of recommendation, aiming at alleviating highly sparse data. Graph contrastive learning models are widely used to learn the consistency of representations by constructing different graph augmentation views. Most current graph augmentation with random perturbation destroy the original graph structure information, which mislead embeddings learning. In this paper, an effective graph contrastive learning paradigm CollaGCL is proposed, which constructs graph augmentation by …using singular value decomposition to preserve crucial structure information. CollaGCL enables perturbed views to effectively capture global collaborative information, mitigating the negative impact of graph structural perturbations. To optimize the contrastive learning task, the extracted meta-knowledge was propagate throughout the original graph to learn reliable embedding representations. The self-information learning between views enhances the semantic information of nodes, thus alleviating the problem of over-smoothing. Experimental results on three real-world datasets demonstrate the significant improvement of CollaGCL over state-of-the-art methods. Show more
Keywords: Self-supervised learning, recommendation, contrastive learning, data augmentation
DOI: 10.3233/JIFS-236497
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Yang, Dianqing | Wang, Wenliang
Article Type: Research Article
Abstract: Unmanned aerial vehicle (UAV) remote-sensing images have a wide range of applications in wildfire monitoring, providing invaluable data for early detection and effective management. This paper proposes an improved few-shot target detection algorithm tailored specifically for wildfire detection. The quality of UAV remote-sensing images is significantly improved by utilizing image enhancement techniques such as Gamma change and Wiener filter, thereby enhancing the accuracy of the detection model. Additionally, ConvNeXt-ECA is used to focus on valid information within the images, which is an improvement of ConvNeXt with the addition of the ECANet attention mechanism. Furthermore, multi-scale feature fusion is performed by …adding a feature pyramid network (FPN) to optimize the extracted small target features. The experimental results demonstrate that the improved algorithm achieves a detection accuracy of 93.2%, surpassing Faster R-CNN by 6.6%. Moreover, the improved algorithm outperforms other target detection algorithms YOLOv8, RT-DETR, YoloX, and SSD by 3.4%, 6.4%, 7.6% and 21.1% respectively. This highlights its superior recognition accuracy and robustness in wildfire detection tasks. Show more
Keywords: Fire target detection, ConvNeXt-ECA, UAV remote-sensing image, feature pyramid network
DOI: 10.3233/JIFS-240531
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Singh, Pratibha | Kushwaha, Alok Kumar Singh | Varshney, Neeraj
Article Type: Research Article
Abstract: Precise video moment retrieval is crucial for enabling users to locate specific moments within a large video corpus. This paper presents Interactive Moment Localization with Multimodal Fusion (IMF-MF), a novel interactive moment localization with multimodal fusion model that leverages the power of self-attention to achieve state-of-the-art performance. IMF-MF effectively integrates query context and multimodal features, including visual and audio information, to accurately localize moments of interest. The model operates in two distinct phases: feature fusion and joint representation learning. The first phase dynamically calculates fusion weights for adapting the combination of multimodal video content, ensuring that the most relevant features …are prioritized. The second phase employs bi-directional attention to tightly couple video and query features into a unified joint representation for moment localization. This joint representation captures long-range dependencies and complex patterns, enabling the model to effectively distinguish between relevant and irrelevant video segments. The effectiveness of IMF-MF is demonstrated through comprehensive evaluations on three benchmark datasets: TVR for closed-world TV episodes and Charades for open-world user-generated videos, DiDeMo dataset, Open-world, diverse video moment retrieval dataset. The empirical results indicate that the proposed approach surpasses existing state-of-the-art methods in terms of retrieval accuracy, as evaluated by metrics like Recall (R1, R5, R10, and R100) and Intersection-of-Union (IoU). The results consistently demonstrate IMF-MF’s superior performance compared to existing state-of-the-art methods, highlighting the benefits of its innovative interactive moment localization approach and the use of self-attention for feature representation and attention modeling. Show more
Keywords: Multimedia data retrieval, query-dependent fusion, ranking system, multimodal retrieval, video segment localization
DOI: 10.3233/JIFS-233071
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl