Deep Learning-Based Classification of Melinjo Chips: Comparative Evaluation of CNN Architectures

Deep Learning-Based Classification of Melinjo Chips: Comparative Evaluation of CNN Architectures

Indera Sakti Nasution* Zaidiyah Zaidiyah Sri Hartuti | Muhammad Fitra

Department of Agricultural Engineering, Faculty of Agriculture, Universitas Syiah Kuala, Banda Aceh 23123, Indonesia

Agricultural Mechanization and Workshop Research Center (PUSMEPTAN), Universitas Syiah Kuala, Banda Aceh 23123, Indonesia

Department of Agricultural Product Technology, Faculty of Agriculture, Universitas Syiah Kuala, Banda Aceh 23123, Indonesia

Corresponding Author Email: 
i.nasution@usk.ac.id
Page: 
633-642
|
DOI: 
https://doi.org/10.18280/mmep.130402
Received: 
7 January 2026
|
Revised: 
5 March 2026
|
Accepted: 
16 March 2026
|
Available online: 
15 May 2026
| Citation

© 2026 The authors. This article is published by IIETA and is licensed under the CC BY 4.0 license (http://creativecommons.org/licenses/by/4.0/).

OPEN ACCESS

Abstract: 

This study investigates the application of deep learning for automated classification of melinjo chips into three quality categories: super, small broken, and large broken. An initial hyperparameter optimization established a performance baseline, with the Adam optimizer (learning rate 0.0005), batch size of 32, and 100 epochs yielding 94.2% accuracy and an F1-score of 0.915. Subsequently, four pretrained Convolutional Neural Networks (CNNs)—ResNet50, ResNet18, MobileNetV2, and AlexNet—were assessed using standard metrics and confusion matrices. ResNet50 achieved the highest performance (accuracy 0.991, F1-score 0.985), demonstrating strong feature extraction capabilities. ResNet18 offered comparable results with shorter training time, while MobileNetV2 provided a favorable trade-off between computational efficiency and accuracy. AlexNet exhibited the lowest performance due to its shallow architecture. Real-time experiments indicated that higher accuracy often comes at the expense of inference speed: ResNet50 delivered superior discriminative metrics (area under the receiver operating characteristic curve (AUC) 0.961, average precision (AP) 0.962) but slower predictions, whereas MobileNetV2 enabled faster inference with slightly reduced precision. These findings emphasize the importance of balancing accuracy and efficiency for deploying deep learning models in food quality assessment systems.

Keywords: 

deep learning, Convolutional Neural Networks, melinjo chips, food quality classification, real-time inference

1. Introduction

Snack chips are a popular snack choice among consumers, and their classification can be crucial for market segmentation and product development. The Central Statistics Agency (BPS) of Indonesia noted that the exports of crackers and chips reached US 37.77 million in 2023. The value increased by 1.35% compared to the previous year [1]. As the demand for diverse snack options continues to rise, understanding the nuances of consumer preferences becomes essential for manufacturers aiming to enhance their product offerings.

Melinjo chips, known as “emping melinjo”, are a popular snack in Indonesia made from the mature seeds of the melinjo fruit (Gnetum gnemon plant). The process begins with the selection of mature melinjo seeds, which are then roasted. Roasting is a crucial step as it softens the seeds, making them easier to flatten. After roasting, the seeds are flattened using a heavy object, typically a wooden pestle, to create thin discs [2]. These discs are then dried in the sun until they become crisp, resulting in a crunchy texture that is highly sought after (Figure 1). The flattening process can lead to inconsistent texture and shape in melinjo chips due to variations in pressure applied, resulting in uneven thickness. This inconsistency can affect the cooking and crisping of the chips, potentially leading to a less desirable texture. To mitigate these issues, some producers have begun to experiment with mechanical flattening methods that promise more uniformity in thickness and texture, thereby enhancing the overall quality of the final product.

Figure 1. Melinjo fruit (left) and melinjo chips (right)

The increasing variety of flavours and textures has led to a need for effective classification methods that can accurately categorize these products based on consumer preferences and quality [3, 4]. As consumer preferences evolve, the ability to accurately classify these products becomes paramount for effective market segmentation and informed product development [5]. It is essential to develop highly accurate, reliable classification systems that can adapt to changing trends and preferences, ensuring that manufacturers stay ahead in a competitive landscape. Different methods, such as both traditional and modern methods, have been put forward to classify these products, including machine learning approaches, clustering methods, artificial intelligence, and deep learning [6]. The key differences between traditional and modern methods of product classification lie in their approach, efficiency, and adaptability to complex data. Traditional methods often rely on feature extraction and predefined rules for product classification, which can limit their adaptability to complex textual data [7], while modern methods leverage advanced technologies such as machine learning and deep learning to automate and enhance the classification process [8].

Food classification is a critical task in computer vision with applications in nutrition monitoring, food safety, and smart agriculture. Traditional machine learning approaches required manual feature engineering, which limited scalability and generalization. In contrast, deep learning architectures such as Convolutional Neural Networks (CNNs) have revolutionized food classification by enabling end-to-end learning directly from raw images [9]. Over the past decade, models like ResNet, MobileNet, and EfficientNet have been widely adopted to classify food items with high accuracy across diverse datasets.

Recent studies highlight the role of deep learning in food recognition for dietary assessment, food quality monitoring, and agricultural product classification. For example, Liu et al. [10] provided a comprehensive review of food image recognition using CNNs, while Chen et al. [11] demonstrated the effectiveness of MobileNetV2 in agricultural product classification. Qiu et al. [12] applied deep learning for automated food image classification in nutrition monitoring, and Nfor et al. [13] showed the potential of CNN-based approaches for food classification tasks. More recently, hybrid approaches combining CNNs and vision transformers have been explored for fine-grained food category classification, further improving accuracy and robustness [14].

These findings consistently demonstrate that deep learning not only surpasses classical methods but also supports real-time applications on mobile and embedded devices. As food systems become increasingly digitized, deep learning continues to provide a powerful framework for advancing intelligent food classification solutions.

Deep learning has emerged as a transformative technology in the food industry, particularly in the quality control and safety assessment of chips. These technologies enable more accurate detection of defects and anomalies, ensuring that only the highest quality products reach consumers. Deep learning models, particularly CNNs, have been effectively used to detect defects in potato chips. The SqueezeNet CNN architecture was used in a study to pull out features from pictures of potato chips. It was able to achieve high classification accuracy with Artificial Neural Networks (ANN), K Nearest Neighbor (KNN), and Random Forest (RF) [15]. Another study explored the use of ResNet50 and Inception-V3 models for defect detection in 3D food printing, which can be analogous to potato chip production, achieving accuracies of 93.83% and 84.62%, respectively [16]. Deep learning, specifically ANNs, has been applied to predict the thermo-physical properties of deep-fried plantains, which share similarities with potato chips. The ANN model accurately predicted properties such as specific heat and density, demonstrating its utility in optimizing frying processes without extensive experimental testing [17]. Deep learning's application extends beyond defect and compound detection to broader food safety and quality assessments. It has been used to classify food items as healthy or unhealthy with 90% accuracy, showcasing its potential in consumer health protection [18]. Deep learning and the Internet of Things (IoT) have been combined to find out more about the quality and safety of food. This is because deep learning is better at dealing with large datasets and complex patterns than traditional methods [19].

The exploration of deep learning models such as ResNet, MobileNet, and AlexNet has significantly advanced the field of food image recognition and classification. These models have been adapted and optimized for various applications, including food security, dietary monitoring, and agricultural productivity. ResNet50 has been integrated with vision transformers to enhance plant disease classification, demonstrating its adaptability and effectiveness in capturing both local and global image features [20]. Additionally, ResNet50 has been employed in food allergy detection, where it achieved a 95% accuracy rate, showcasing its potential in dietary management [21]. The research developed a digital image classification system to identify diseases in corn plants, achieving a high accuracy of 95.59% using the ResNet-50 architecture [22]. The ResNet18-based model has been optimized for indoor agriculture applications, such as predicting crop yields [23]. MobileNetV2 is favored for its efficiency and lightweight nature, making it suitable for applications with limited computational resources. It has been effectively used in plant disease diagnosis, achieving high accuracy and enabling faster and more accurate disease detection compared to traditional methods [24]. AlexNet has been used to classify sugarcane leaf diseases, with different activation functions like ReLU and LeakyReLU affecting the model's accuracy and computational efficiency [25]. Various CNN architectures were tested, including SqueezeNet, AlexNet, and ResNet models, to compare their classification performance [22, 26].

Even though CNNs have garnered significantly greater focus in comparison to alternative machine learning methodologies, their application in the study of snack chips, specifically melinjo, has been rarely investigated. Previous works on food classification have primarily concentrated on broader categories such as fruits, vegetables, or prepared meals, often utilizing architectures like ResNet, MobileNet, or Inception to achieve high accuracy [14, 27, 28]. However, studies targeting niche or region-specific food items remain limited, and critical challenges such as dataset variability, reproducibility, and generalization across diverse food types are still insufficiently addressed. The aim of this study is therefore to investigate the potential of CNNs in enhancing the classification of melinjo chips by applying different models and evaluating their performance.

2. Materials and Method

2.1 Dataset preparation

The dataset used in this study consists of images of melinjo chips collected from various production batches from melinjo chip producers in Pidie regency, Aceh, Indonesia, ensuring a diverse representation of quality and appearance. Three different types of melinjo chips were included: large broken, small broken, and super (Figure 2), each categorized based on their unique characteristics and processing methods. In total, 1,500 images were collected and subsequently divided into training (70%), validation (15%), and testing (15%) subsets. To prevent data leakage and ensure the reliability of the evaluation, images from a given production batch were allocated exclusively to either the training, validation, or testing set, but were not shared across multiple splits. This approach was adopted to ensure that the reported performance reflects the model’s ability to generalize beyond batch‑specific characteristics.

super

small broken

large broken

Figure 2. Classification of melinjo chips into three categories

The images were taken using a CCD camera (DBK41BU02.H, The Imaging Source Europe GmbH, Bremen, Germany) equipped with a 4.5–12.5 mm lens (Computer H3Z4512CS-IR 1/2” varifocal day/night lens). Images were taken from multiple angles and distances to reflect realistic scenarios, and with controlled light conditions. The image acquisition was performed with the IC Capture version. 2.5 TIS software (The Imaging Source Europe GmbH, Bremen, Germany). The image was captured against a black background of a conveyor belt to construct the model, which will be subsequently employed to categorize other melinjo chip samples in real-time (see subsection 2.2). Data augmentation techniques were applied to improve model generalization and reduce overfitting, including rotations and brightness variation adjustment. All experiments were conducted on a laptop running Windows 11, equipped with an Intel Core i5 CPU (12th Gen Intel, 4.40 GHz), integrated Intel Iris Xe Graphics with 8 GB system RAM.

2.2 Real-time classification setting

Real-time classification was conducted for all classifiers (4 classifiers), with each classifier classifying 300 melinjo chips at a conveyor belt speed of 2 cm/s (Figure 3). In this study, real-time image acquisition was performed using the Iriun 4K webcam application, which enabled the smartphone camera to function as a webcam connected to the PC via USB. The images were taken using the Infinix Note 30, which has a 64 MP main rear camera (f/1.69 aperture) and supports video resolution of 1440p at 30 fps and 1080p at 30/60 fps options.

Figure 3. Setting for real-time classification of melinjo chips

2.3 Convolutional Neural Network

For model training, the image dataset was pre-processed to a standardized square format of 224 × 224 pixels. Regularization was applied using a weight decay of 0.0001, and optimization employed a momentum value of 0.9. Four pretrained CNN classifiers, such as ResNet18, AlexNet, ResNet50, and MobileNetV2, were evaluated in this study. These models were customized and implemented using the MVTec Halcon software environment. The complete set of model parameters utilized in the experiments is presented in Table 1.

Table 1. Model parameters applied

Parameters

Objectives

Optimization algorithm

An optimization approach is employed during training to reduce the total loss function value.

Learning rate

The learning rate regulates the extent to which the model's weights are modified during training.

Batch size

The batch size specifies the quantity of images processed prior to an update of the model's weights.

Momentum

Momentum is used to smooth out gradient updates and accelerate convergence.

Weight decay

This regularization term mitigates excessive weight values, enhancing the model's generalization capacity and preventing overfitting.

Image size

To preserve consistency across the dataset and guarantee compatibility with deep learning frameworks.

Epochs

The number of epochs was chosen to provide adequate learning while avoiding overfitting.

2.3.1 ResNet18

ResNet18 (pretrained_dl_classifier_resnet18.hdl) is part of the ResNet family of residual networks that utilize skip connections to enhance the flow of information and gradients throughout the layers, allowing for more effective training on complex datasets. The architecture consists of 18 layers, which include convolutional layers, batch normalization, and ReLU activation functions, making it suitable for image classification tasks. The model's ability to learn hierarchical features from the input images contributes significantly to its performance, particularly in distinguishing subtle variations between classes [29, 30].

2.3.2 AlexNet

AlexNet (pretrained_dl_classifier_alexnet.hdl) classifier is a pioneering CNN that played a crucial role in popularizing deep learning for image recognition. With its eight layers, including five convolutional layers and three fully connected layers, AlexNet introduced techniques such as dropout and data augmentation to combat overfitting and improve generalization. Its architecture is designed to capture complex patterns in images, making it particularly effective for large-scale classification tasks. AlexNet is a well-established CNN architecture that has been widely used for various image classification tasks. It is particularly effective in scenarios where large datasets are available, as it can learn complex features from the data [31].

2.3.3 ResNet50

ResNet-50 (pretrained_dl_classifier_resnet50.hdl) has emerged as one of the most influential deep CNN architectures in computer vision research [32]. Its design, based on residual learning, effectively addresses the vanishing gradient problem and enables the training of very deep networks with improved accuracy. Over the past five years, ResNet-50 has been widely applied to agricultural applications [16, 20, 33, 34].

2.3.4 MobileNetV2

MobileNetV2 (pretrained_dl_classifier_mobilenetv2.hdl) is a CNN architecture that is lightweight and optimized for mobile and embedded vision applications. It is a successor to MobileNetV1 and was developed by Google researchers. It introduces linear bottlenecks and inverted residuals, which considerably reduce computational costs while maintaining high accuracy [35]. This design is particularly well-suited for real-time agricultural monitoring and smart farming systems, as it facilitates the efficient deployment of deep learning models on resource-constrained devices. MobileNetV2 has been extensively implemented in precision agriculture, crop disease detection, and product classification duties over the past five years [36-38]. It has become the preferred choice for edge-based agricultural applications due to its capacity to balance efficiency and accuracy, which are essential in sectors where energy efficiency and low latency are essential. MobileNetV2's adaptability is further emphasized in recent studies when it is combined with data augmentation and transfer learning, which further enhances its robustness in a variety of agricultural environments [11].

2.4 Fine-tuning

CNNs pretrained on large-scale datasets (specifically ResNet50, ResNet18, MobileNetV2, and AlexNet) were adapted to the task of melinjo chip classification by selectively unfreezing their final layers (Figure 4). To ensure reproducibility, the fine‑tuning procedures applied to all pretrained models are detailed as follows. Two strategies were considered: a frozen backbone, where convolutional layers were kept fixed, and only the fully connected layers were retrained, and full fine‑tuning, where all layers were optimized. Training followed a step‑decay learning rate schedule, starting at 0.001 and reduced by a factor of 0.1 every ten epochs. Early stopping was implemented with a patience of ten epochs.

Figure 4. Models' architectures of fine-tuning
Note: The output layer is modified from the original 1000 ImageNet classes to three melinjo chip categories (super, small broken, large broken).

In the case of ResNet50 and ResNet18, the last residual block together with the fully connected layers were retrained; for MobileNetV2, the final bottleneck block and classifier were adjusted; and for AlexNet, the three fully connected layers (FC6, FC7, and FC8) were optimized. This targeted fine-tuning strategy enabled the networks to refine high-level feature representations in line with the characteristics of the melinjo dataset.

2.5 Evaluation matrix

Various evaluation metrics were employed to assess the performance of the classifiers (Eqs. (1)-(6)), including accuracy, precision, recall, and F1-score [11]. These metrics provide a comprehensive understanding of how well the models perform under different conditions and help identify areas for improvement. The formula of these metrics can be expressed in Table 2.

Table 2. Evaluation matrix formula

Formula

What it Measures

Equations

Accuracy $=\frac{T P+T N}{\text { Total of data used }}$

Overall correctness of the model

(1)

Precision $=\frac{T P}{T P+F P}$

Correctness of positive predictions

(2)

Recall $=\frac{T P}{T P+F N}$

Ability to find all positive instances

(3)

$F 1-$ score $=2 \times \frac{\text { Precision } \times \text { Recall }}{\text { Precision }+ \text { Recall }}$

Harmonic mean of precision and recall

(4)

True Positive Rate $=\frac{T P}{T P+F N}$

Proportion of actual positives correctly identified

(5)

False Positive Rate $=\frac{F P}{F P+T N}$

Proportion of actual negatives incorrectly classified as positives

(6)

True Positive (TP) refers to positive samples that are correctly identified, while False Positive (FP) denotes negative samples that are mistakenly classified as positive. True Negative (TN) represents negative samples accurately recognized as such, and False Negative (FN) indicates positive samples that are incorrectly labeled as negative. These four quantities form the foundation for calculating key evaluation metrics, including accuracy, precision, recall, and the F1-score. They also support the construction of receiver operating characteristic (ROC) and precision-recall (PR) curves [39], which together provide a comprehensive view of classifier performance.

The area under the curves (AUC)-ROC is defined as the integral of the True Positive Rate with respect to the False Positive Rate, offering a measure of the classifier’s overall discriminative ability. In contrast, the AUC-PR curve emphasizes the trade-off between precision and recall, making it particularly valuable for imbalanced datasets where positive samples are relatively rare. For finding AUC, the trapezoidal rule was used [40, 41].

2.6 Hyperparameter tuning and comparative evaluation

Hyperparameter tuning was carried out in a structured manner using the MVTec Halcon software. Several configurations were explored, including the choice of optimizer (Adam and stochastic gradient descent (SGD)), different learning rates (0.001, 0.0005, 0.0001), batch sizes (16, 32, 64), and varying numbers of epochs (50, 100, 150). Each setup was evaluated on the validation set through a grid search strategy, with performance measured in terms of accuracy, F1-score, and the standard deviation across multiple runs to ensure both convergence stability and generalization. To obtain reliable estimates of variability, every experiment was repeated five times, and the mean values together with their standard deviations were reported. The configuration that delivered the most consistent results was selected as the baseline for subsequent comparisons across classifier architectures.

In addition, four pretrained classifiers provided by the HALCON deep learning framework (AlexNet, ResNet18, ResNet50, and MobileNetV2) were tested under the same experimental settings to provide a comparative evaluation. Their outputs were analyzed using confusion matrices. Real-time classification experiments were also conducted for all four models. Overall accuracy and F1-score were computed to assess predictive reliability, while AUC-ROC and AUC-PR metrics were used to evaluate discriminative ability and threshold-dependent behavior. Furthermore, the inference time per sample was measured to capture computational efficiency during real-time processing.

3. Results and Discussion

3.1 Hyperparameter and fine-tuning

The results of hyperparameter tuning establish a clear baseline for evaluating model performance before conducting comparative tests across different classifier architectures. The experiments produced accuracies ranging from 91.5% to 94.2%, with F1-scores between 0.874 and 0.915 (Table 3).

Table 3. Hyperparameter tuning

Configuration

Accuracy (%) ± Standard Deviation

F1-Score

Optimizer Adam

92.0 ± 0.9

0.875

Optimizer Stochastic Gradient Descent (SGD)

91.5 ± 1.0

0.874

Learning rate 0.001

92.2 ± 0.8

0.880

Learning rate 0.0001

92.8 ± 0.9

0.890

Learning rate 0.0005

93.5 ± 0.7

0.900

Batch size 16

92.3 ± 0.8

0.885

Batch size 32

93.8 ± 0.6

0.905

Batch size 64

92.5 ± 0.9

0.890

Epochs 50

92.1 ± 0.8

0.880

Epochs 100

94.2 ± 0.5

0.915

Epochs 150

93.7 ± 0.6

0.910

Among the tested settings, the configuration using the Adam optimizer with a learning rate of 0.0005, batch size of 32, and 100 training epochs yielded the most consistent outcomes, achieving 94.2% accuracy, an F1-score of 0.915, and the lowest variability (±0.5). This setup combined strong predictive capability with stable convergence on the importance of learning rate schedules [42] and the stability of training with moderate batch sizes [43]. In contrast, the use of SGD resulted in slightly lower accuracy and greater variance. Extreme learning rate values (0.001 or 0.0001) and smaller batch sizes also reduced stability. Likewise, shorter training runs (50 epochs) led to underfitting, while extending training to 150 epochs introduced minor overfitting without meaningful performance improvements [44].

The results of the fine‑tuning led to consistent improvements in classification accuracy across all models (Table 4). ResNet50 and MobileNetV2 achieved the most substantial gains, with increases of 5.6% and 5.7%, respectively. ResNet18 also showed a notable improvement of 4.6%, while AlexNet recorded a smaller rise of 3.2%. These accuracy gains were accompanied by minor increases in inference time, ranging from 0.02 to 0.07 seconds. Overall, the findings underscore the trade‑off introduced by fine‑tuning: accuracy is enhanced considerably, while the additional computational cost remains minimal, thereby maintaining the practicality of real‑time deployment, particularly for lightweight architectures such as MobileNetV2.

Table 4. Accuracy and inference time of the model after fine-tuning

Model

Increasing Accuracy

Increasing Inference Time

ResNet50

+5.6%

+0.07 s

ResNet18

+4.6%

+0.03 s

MobileNetV2

+5.7%

+0.02 s

AlexNet

+3.2%

+0.02 s

3.2 Classifier performance

A comparative study was undertaken on four CNN architectures (ResNet18, ResNet50, MobileNetV2, and AlexNet) to classify melinjo chips into three categories: super, small broken, and large broken (Figure 5).

Figure 5. Confusion matrix of different classifications

The evaluation relied on confusion matrix visualization and standard performance metrics. ResNet50 emerged as the most reliable model, achieving a precision of 0.986, a recall of 0.986, an F1-score of 0.985, and an overall accuracy of 0.991 (Table 5). These results are consistent with recent work that highlighted the stability and generalization capacity of ResNet50 in structural image analysis tasks [45].

MobileNetV2 followed with an accuracy of 0.980 and an F1-score of 0.970 (Table 5). Despite its lightweight design, the model demonstrated strong performance, confirming the efficiency-accuracy balance reported in studies of resource-constrained environments. The diagonal dominance observed in its confusion matrix further supports its robustness in fine-grained classification; a finding echoed in biomedical imaging applications [46].

Table 5. Performance of the classifier

Classifier

Precision

Recall

F1-Score

Accuracy ± (95% CI)

ResNet18

0.985

0.984

0.984

0.985 ± 0.008

AlexNet

0.900

0.862

0.874

0.922 ± 0.015

MobileNetV2

0.970

0.970

0.970

0.980 ± 0.010

ResNet50

0.986

0.986

0.985

0.991 ± 0.007

Note: CI = Confidence intervals, computed using bootstrapping with 100 runs.

ResNet18 achieved an accuracy of 0.985 and an F1-score of 0.984, slightly below ResNet50 but still demonstrating high precision and recall. Its residual connections help mitigate vanishing gradient issues [10], contributing to stable performance. Nevertheless, its shallower depth compared to ResNet50 may limit representational capacity in more complex feature spaces.

AlexNet, while historically important in the development of CNNs, recorded the lowest performance among the four models (accuracy 0.922, F1-score 0.874). The confusion matrix revealed higher misclassification rates, particularly between the small broken and large broken categories. This outcome aligns with recent comparative analyses that emphasize AlexNet’s limitations in fine-grained classification tasks due to its shallow architecture and absence of residual learning [47].

Training time and loss values were further examined to assess computational efficiency and convergence quality (Table 6). The results reveal a clear trade-off between model depth and training cost: deeper models achieved lower loss but required longer training durations.

Table 6. Time elapsed and loss of classifier

Classifier

Time Elapsed

Loss (95% CI)

ResNet18

13 minutes 52 seconds

0.2791 ± 0.012

AlexNet

7 minutes 3 seconds

1.0002 ± 0.045

MobileNet V2

12 minutes 52 seconds

1.0677 ± 0.038

ResNet50

4 hours 14 minutes 2 seconds

0.1017 ± 0.009

Note: CI = Confidence intervals, computed using bootstrapping with 100 runs.

ResNet50 demonstrated the most effective convergence, reaching the lowest loss value of 0.1017, though training extended to 4 hours 14 minutes. Its deep residual architecture enables superior feature extraction and generalization, but the high parameter count demands more iterations to stabilize gradients [48]. ResNet18, with fewer layers, completed training in 13 minutes 52 seconds and produced a moderate loss of 0.2791. While faster, its reduced depth limited its ability to capture complex patterns [49]. MobileNetV2, designed for efficiency, trained in 12 minutes 52 seconds but yielded the highest loss (1.0677), reflecting its emphasis on speed over precision [50]. AlexNet trained the fastest (7 minutes 3 seconds) but produced a relatively high loss, underscoring the constraints of its simple architecture in multi-class classification scenarios [51].

3.3 Real-time classification performance

A comparative analysis was carried out on four CNN architectures (ResNet50, ResNet18, AlexNet, and MobileNetV2) to evaluate their performance in classifying melinjo chips under real-time conditions (Figure 6). Among these, ResNet50 achieved the strongest results, with an AUC of 0.961 and an average precision (AP) of 0.962. These values highlight its ability to separate classes effectively and maintain precision across varying recall thresholds. Similar observations have been reported in agricultural studies, where ResNet50 consistently outperforms shallower models in tasks due to its deep residual connections and robust feature extraction capacity [52-54].

Figure 6. Area under the curve (AUC) of receiver operating characteristic (ROC) and precision-recall (PR) curves

ResNet18 followed closely with an AUC of 0.926 and an AP of 0.929, confirming its reliability despite having a shallower depth. Recent work on maize seed classification demonstrated that ResNet18 can achieve strong performance when image resolution and depth are carefully optimized [55]. Its balance between computational efficiency and accuracy makes it a practical choice for agricultural applications where processing speed is critical.

AlexNet, although historically important in the development of deep learning, produced moderate outcomes (AUC 0.877, AP 0.902). The absence of residual connections and limited depth restricts its ability to capture subtle differences in agricultural commodities. Recent studies in fruit grading have shown that AlexNet’s performance tends to plateau compared to more advanced CNNs, particularly when precision–recall trade-offs are emphasized [56, 57].  

MobileNetV2 recorded the lowest curve metrics (AUC 0.811, AP 0.845), reflecting its lightweight design optimized for speed and resource efficiency. While this architecture is advantageous for embedded agricultural systems, its reduced representational capacity limits precision in complex classification tasks. Similar limitations have been observed in crop disease detection, where MobileNetV2 achieved faster inference times but lower accuracy than deeper CNNs [58].

The efficiency of deep learning models in agricultural applications depends not only on predictive accuracy but also on their inference time, which determines the feasibility of real-time deployment. In the case of melinjo classification, inference time reflects how quickly a trained network processes new images and delivers reliable predictions (Figure 7).

Large broken detected

Small broken detected

Figure 7. Example of real-time classification of a melinjo chips sample using MVTec Halcon software

This factor is critical for practical agricultural systems such as automated grading, sorting, and quality inspection, where throughput and latency directly influence productivity. Our results highlighted that ResNet50, although offering high classification fidelity, required longer inference times, making it less suitable for edge-based systems (Table 7). ResNet50 demonstrated superior accuracy largely because of its deeper architecture and the larger number of parameters it employs. This design enables the network to learn complex and highly discriminative features from the dataset, allowing for more precise classification [59]. At the same time, the increased depth comes with a computational burden, which explains the longer inference times observed. In contrast, MobileNetV2 achieved the fastest inference speed due to its lightweight structure. The model relies on depthwise separable convolutions and an efficient bottleneck mechanism, both of which substantially reduce the number of operations required during prediction. These architectural innovations make MobileNetV2 particularly suitable for embedded agricultural applications where rapid decision‑making is critical [60]. The results highlight a clear trade‑off between accuracy and speed, reflecting the fundamental design differences of the two models. Selecting the appropriate architecture, therefore, depends on the operational requirements of the agricultural system in question. This interpretation aligns with recent studies showing that deeper residual networks, such as ResNet50, consistently outperform earlier CNNs in terms of accuracy because of their ability to capture complex hierarchical features [61]. Conversely, MobileNetV2’s streamlined design, based on depthwise separable convolutions and bottleneck layers, has been shown to lower computational cost and inference time, making it well‑suited for real‑time deployment on resource‑constrained platforms [62].

Table 7. Inference time real-time classification

Classifiers

Inference Time (ms)

95% CI (ms)

ResNet50

50.22

±2.10

ResNet18

32.18

±1.75

AlexNet

28.17

±1.60

MobileNet V2

18.15

±1.20

Note: CI = Confidence intervals, computed using bootstrapping with 100 runs.

Recent advances in agricultural research highlight the growing importance of lightweight deep learning architectures for real-time deployment. Models optimized for inference speed can significantly enhance decision-making in precision crop protection, allowing timely interventions in the field [63]. In a similar vein, another study emphasized that striking a balance between computational efficiency and predictive accuracy is essential for sustainable automation in agriculture, particularly when operating under resource constraints [64]. Another study also introduced LiSA-MobileNetV2, an extremely lightweight model for rice disease classification, showing that inference efficiency directly impacts scalability when handling large datasets [65]. Further study validated the role of MobileNetV2 in intelligent agriculture, reporting that lightweight architectures can maintain accuracy while reducing computational costs [66]. More recently, modified MobileNetV2 networks have made it possible to find tomato diseases in real time. This shows that inference time is just as important as accuracy when choosing models for agricultural use [12].

4. Conclusion

The study confirms that CNN-based approaches can effectively classify melinjo chips, with ResNet50 emerging as the most accurate and reliable model. Its deep residual connections enable robust feature extraction and generalization, though this comes with higher computational cost and longer inference times. ResNet18 provides a practical compromise, offering strong accuracy with reduced training duration. MobileNetV2, designed for efficiency, is particularly well-suited for real-time and embedded agricultural applications where rapid decision-making is critical. AlexNet, while foundational in the history of deep learning, demonstrated limited capability in fine-grained agricultural classification tasks.

It is important to recognize certain limitations of this study. The dataset employed was relatively modest in size and collected within a controlled acquisition setting, which may not fully reflect the variability encountered under real agricultural conditions. In addition, potential class imbalance could have influenced the training process and model performance, particularly in minority categories. These constraints indicate that, while the findings are promising, they should be interpreted with caution when considering broader applications.

Overall, the results emphasize that model selection in agricultural contexts should balance predictive accuracy, generalization ability, and inference speed. Deeper architectures such as ResNet50 are preferable for high-fidelity classification, while lightweight models like MobileNetV2 are advantageous for real-time deployment in resource-constrained environments. These insights contribute to the development of scalable and efficient deep learning solutions for commodity grading and quality inspection in precision agriculture.

Acknowledgment

The authors gratefully acknowledge the financial support of the Directorate General of Higher Education, Ministry of Education and Culture of the Republic of Indonesia in the year of 2024. The image processing software supported by the MVTec Halcon is also gratefully acknowledged.

  References

[1] Central Bureau of Statistics of Indonesia. (2024). Indonesia Foreign Trade Statistics Exports 2023. https://www.bps.go.id/en/publication/2024/07/05/aacd81374f3ae5d241c14598/indonesia-foreign-trade-statistics-exports-2023--book-i.html.

[2] Dhafir, M., Zulfahrizal, Z., Fadhil, R., Safrizal, S, Setiawati, S. (2023). Rancang bangun alat pemipih emping Melinjo (Gnetum gnemon) tipe mekanis. Rona Teknik Pertanian, 14(1): 38-50. https://doi.org/10.17969/rtp.v14i1.19651

[3] Fagundes Grilo, M., Nunes, B., Duran, A.C., Zancheta Ricardo, C., et al. (2024). Applying the Nova food classification to food product databases using discriminative ingredients: A methodological proposal. Frontiers in Public Health, 12: 24305721. https://doi.org/10.1101/2024.04.12.24305721

[4] Sudharson, S., Kokil, P., Annamalai, R., Manoj, N.S. (2023). Enhanced food classification system using YOLO models for object detection algorithm. In 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT), Delhi, India, pp. 1-6. https://doi.org/10.1109/ICCCNT56998.2023.10307943 

[5] Sun, F., Luh, D.B., Zhao, Y., Sun, Y. (2022). Product classification with the motivation of target consumers by deep learning. IEEE Access, 10: 62258-62267. https://doi.org/10.1109/ACCESS.2022.3181624

[6] Xu, W., Zhu, Z., Wang, L. (2022). Comparative analysis of different machine learning algorithms in classification. In 2022 International Conference on Big Data, Information and Computer Network (BDICN), Sanya, China, pp. 257-263. https://doi.org/10.1109/BDICN55575.2022.00057 

[7] İhsanoğlu, A., Zaval, M., Yıldız, O.T. (2024). Comparison of machine learning algorithms and large language models for product categorization. In 2024 32nd Signal Processing and Communications Applications Conference (SIU), Mersin, Türkiye, pp. 1-4. https://doi.org/10.1109/SIU61531.2024.10600809

[8] Jajuga, K., Dehnel, G., Walesiak, M. (Eds.). (2022). Modern Classification and Data Analysis: Methodology and Applications to Micro-and Macroeconomic Problems. Springer International Publishing. https://doi.org/10.1007/978-3-031-10190-8

[9] Yusufoğlu, B., Özdemir, S., Zargarchi, S., Gündoğan, R., Capanoglu, E., Tekinerdoğan, B., Esatbeyoglu, T. (2026). Machine and deep learning for food integrity: A review of emerging analytical applications in safety, quality, and authenticity. Applied Food Research, 6(1): 102100. https://doi.org/10.1016/J.AFRES.2026.102100

[10] Liu, H., Brailsford, T., Bull, L. (2025). ResNet18 performance: Impact of network depth and image resolution on image classification. In ICAAI '24: Proceedings of the 2024 8th International Conference on Advances in Artificial Intelligence, London, United Kingdom, pp. 351-356. https://doi.org/10.1145/3704137.3704173

[11] Chen, H., Zhou, G., He, W., Duan, X., Jiang, H. (2024). Classification and identification of agricultural products based on improved MobileNetV2. Scientific Reports, 14: 3454. https://doi.org/10.1038/s41598-024-53349-w

[12] Qiu, H., Yang, J., Jiang, J., Zhang, W. (2024). Mob-psp: Modified MobileNet-V2 network for real-time detection of tomato diseases. Journal of Real-Time Image Processing, 21: 181. https://doi.org/10.1007/s11554-024-01561-2

[13] Nfor, K.A., Theodore Armand, T.P., Ismaylovna, K.P., Joo, M.I., Kim, H.C. (2025). An explainable CNN and vision transformer-based approach for real-time food recognition. Nutrients, 17(2): 362. https://doi.org/10.3390/nu17020362

[14] Chen, C.S., Chen, G.Y., Zhou, D., Jiang, D., Chen, D., Chang, S.H. (2025). Improving fine-grained food classification using deep residual learning and selective state space models. PLOS ONE, 20(5): e0322695. https://doi.org/10.1371/journal.pone.0322695

[15] Taspinar, G., Koklu, M. (2024). Detecting industrial potato chips defects with machine learning methods using deep features. Intelligent Methods in Engineering Sciences, 3(2): 63-69. https://doi.org/10.58190/IMIENS.2024.100

[16] Mawardi, C., Buono, A., Priandana, K. (2024). Performance analysis of ResNet50 and Inception-V3 image classification for defect detection in 3D food printing. International Journal on Advanced Science, Engineering & Information Technology, 14(2): 798-804. https://doi.org/10.18517/IJASEIT.14.2.19863 

[17] Adeyanju, J.A., Olajide, J.O., Oke, E.O., Hussein, J.B., Ude, C.J. (2021). Prediction of the thermo-physical properties of deep-fat frying plantain chips (ipekere) using artificial neural network. Acta Universitatis Cinbinesis, Series E: Food Technology, 25(2): 253-260. https://doi.org/10.2478/AUCFT-2021-0023 

[18] Natesan, P., Selvan, T., Shrivarshini, M., Dhanya, B., Kalaiselvi, S. (2023). Prediction of healthy and unhealthy food items using deep learning. In 2023 7th International Conference on Computing Methodologies and Communication (ICCMC), Erode, India, pp. 233-239. https://doi.org/10.1109/ICCMC56507.2023.10084041

[19] Shukla, A., Mandal, D., Meena, R.S., Vijayarajan, V. (2022). Artificial intelligence and deep learning-based agri and food quality and safety detection system. In International Conference on Intelligent Computing and Communication, Springer, Singapore, pp. 81-91. https://doi.org/10.1007/978-981-99-1588-0_8

[20] Kumaar, D.M., Palani, S. (2024). ResNet50 integrated vision transformer for enhanced plant disease classification. In 2024 3rd International Conference on Artificial Intelligence for Internet of Things (AIIoT), Vellore, India, pp. 1-6. https://doi.org/10.1109/AIIOT58432.2024.10574771

[21] Senapati, B., Talburt, J.R., Naeem, A.B., Batthula, V.J.R. (2023). Transfer learning based models for food detection using ResNet-50. In 2023 IEEE International Conference on Electro Information Technology (eIT), Romeoville, IL, USA, pp. 224-229. https://doi.org/10.1109/EIT57321.2023.10187288

[22] Rachmad, A. (2023). Convolutional neural network-based classification model of corn leaf disease. Mathematical Modelling of Engineering Problems, 10(2): 530-536. https://doi.org/10.18280/mmep.100220

[23] Radha, D., Prasanna, S. (2024). A unique ADAGRAD optimized DCNN with RESNET-18 architecture for indoor agriculture-based crop yield. In 2024 IEEE International Conference on Computing, Power and Communication Technologies (IC2PCT), Greater Noida, India, pp. 767-771. https://doi.org/10.1109/IC2PCT60090.2024.10486749

[24] Prabhu, D., Dilip, G. (2024). Chilly leaves diseases identification using MobileNet V2 deep learning model. Journal of Electrical Systems, 20(3s): 150-157. https://doi.org/10.52783/JES.1263

[25] Kursun, R., Yasin, E.T., Koklu, M. (2024). Classification of sugarcane leaf disease with AlexNet model. Proceedings of International Conference on Intelligent Systems and New Applications, 2: 32-37. https://doi.org/10.58190/icisna.2024.86

[26] Binnar, V., Sharma, S. (2023). Plant leaf diseases detection using deep learning algorithms. In Machine Learning, Image Processing, Network Security and Data Sciences, pp. 217-228. https://doi.org/10.1007/978-981-19-5868-7_17

[27] Rahin, M., Devi, A., Gowtham, M., Sivaraman, C. (2025). A novel deep learning framework for food ingredient classification using ResNet and SMO classifiers. In 2025 International Conference on Computational Innovations and Engineering Sustainability (ICCIES), Coimbatore, India, pp. 1-6. https://doi.org/10.1109/ICCIES63851.2025.11032532

[28] Ciocca, G., Napoletano, P., Schettini, R. (2016). Food recognition: A new dataset, experiments, and results. IEEE Journal of Biomedical and Health Informatics, 21(3): 588-598. https://doi.org/10.1109/JBHI.2016.2636441

[29] Wang, M., Gong, X. (2020). Metastatic cancer image binary classification based on ResNet model. In 2020 IEEE 20th International Conference on Communication Technology (ICCT), Nanning, China, pp. 1356-1359. https://doi.org/10.1109/ICCT50939.2020.9295797 

[30] Kobayashi, G., Shouno, H. (2021). Interpretation of ResNet by visualization of the preferred stimulus in receptive fields. In Advances in Parallel & Distributed Processing, and Applications: Proceedings from PDPTA'20, CSC'20, MSV'20, and GCC'20, pp. 769-779. https://doi.org/10.1007/978-3-030-69984-0_56

[31] Singh, I., Goyal, G., Chandel, A. (2022). AlexNet architecture based convolutional neural network for toxic comments classification. Journal of King Saud University - Computer and Information Sciences, 34(9): 7547-7558. https://doi.org/10.1016/J.JKSUCI.2022.06.007

[32] He, K., Zhang, X., Ren, S., Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, pp. 770-778. https://doi.org/10.1109/CVPR.2016.90

[33] Xue, X., Thakur, V., Dhumras, H., Jhaveri, R.H., Gadekallu, T.R. (2025). Real-time pest detection using ResNet-50 and vision transformer: An IoT-enabled mobile application for smart agriculture. IEEE Transactions on Consumer Electronics, 71: 8989-8997. https://doi.org/10.1109/TCE.2025.3599533

[34] Wu, W., Huo, L., Yang, G., Liu, X., Li, H. (2025). Research into the application of ResNet in soil: A review. Agriculture, 15(6): 661. https://doi.org/10.3390/agriculture15060661

[35] Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.C. (2018). Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, pp. 4510-4520. https://doi.org/10.1109/CVPR.2018.00474

[36] Njoroge, T.K., Omol, E.J., Nyangaresi, V.O. (2025). Deep learning and IoT fusion for crop health monitoring: A high-accuracy, edge-optimised model for smart farming. IET Image Processing, 19(1): e70208. https://doi.org/10.1049/ipr2.70208

[37] Jesupriya, J., Mageswari, P.U., Alli, A. (2025). Deep learning-based transfer learning with MobileNetV2 for crop disease detection. In 2025 International Conference on Intelligent and Innovative Technologies in Computing, Electrical and Electronics (IITCEE), Bangalore, India, pp. 1-9. https://doi.org/10.1109/IITCEE64140.2025.10915399

[38] Moses, K., Miglani, A., Kankar, P.K. (2022). Deep CNN-based damage classification of milled rice grains using a high-magnification image dataset. Computers and Electronics in Agriculture, 195: 106811. https://doi.org/10.1016/j.compag.2022.106811

[39] Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters, 27(8): 861-874. https://doi.org/10.1016/j.patrec.2005.10.010

[40] Safder, I., Batool, H., Sarwar, R., Zaman, F., et al. (2022). Parsing AUC result-figures in machine learning specific scholarly documents for semantically-enriched summarization. Applied Artificial Intelligence, 36(1): 2004347. https://doi.org/10.1080/08839514.2021.2004347

[41] Pavičić, J., Andreić, Ž., Malvić, T., Rajić, R., Velić, J. (2018). Application of Simpsons and trapezoidal formulas for volume calculation of subsurface structures-recommendations. In 2nd Croatian Congress on Geomathematics and Geological Terminology, UK, pp. 15-26.

[42] Smith, L.N. (2017). Cyclical learning rates for training neural networks. In 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA, pp. 464-472. https://doi.org/10.1109/WACV.2017.58

[43] Keskar, N.S., Mudigere, D., Nocedal, J., Smelyanskiy, M., Tang, P.T.P. (2016). On large-batch training for deep learning: Generalization gap and sharp minima. arXiv preprint arXiv:1609.04836. https://doi.org/10.48550/arXiv.1609.04836

[44] Bengio, Y., Alain, G., Rifai, S. (2012). Implicit density estimation by local moment matching to sample from auto-encoders. arXiv preprint arXiv:1207.0057. https://doi.org/10.48550/arXiv.1207.0057

[45] Bussa, S.K., Boppana, N.K. (2025). Enhanced ResNet50 deep learning algorithm for classification of crack images in RCC structures. Asian Journal of Civil Engineering, 26(9): 3773-3784. https://doi.org/10.1007/s42107-025-01396-7

[46] Rahman, M.A., Miah, M.B.A., Hossain, M.A., Hosen, A.S. (2025). Enhanced brain tumor classification using MobileNetV2: A comprehensive preprocessing and fine-tuning approach. BioMedInformatics, 5(2): 30. https://doi.org/10.3390/biomedinformatics5020030

[47] Dioses, I.A.M., Castillo, W., Santiago, P.N., Geronimo, M.P., Macaso, J.A., Domingo, S.D. (2025). Performance of ResNet50 in classifying artificial intelligence generated arts. In 2025 21st IEEE International Colloquium on Signal Processing & Its Applications (CSPA), Pulau Pinang, Malaysia, pp. 361-366. https://doi.org/10.1109/CSPA64953.2025.10933073

[48] Zhao, C.H., Leong, W.Y. (2025). Optimization solutions and simple innovative solution research on ResNet50 model. ASM Science Journal, 20(1): 2002. https://doi.org/10.32802/asmscj.2025.2002

[49] Liu, X., Goh, K.M. (2025). ResNet: Enabling deep convolutional neural networks through residual learning. arXiv preprint arXiv:2510.24036. https://doi.org/10.48550/arXiv.2510.24036

[50] Askale, G.T., Yibel, A.B., Munie, A.T., Areru, S.B., Debasu, M.A. (2026). A customized MobileNetV2-based lightweight CNN for monkeypox detection and classification. Scientific Reports, 16: 5028. https://doi.org/10.1038/s41598-026-35871-1

[51] Kuldashboy, A., Umirzakova, S., Allaberdiev, S., Nasimov, R., Abdusalomov, A., Im Cho, Y. (2024). Efficient image classification through collaborative knowledge distillation: A novel AlexNet modification approach. Heliyon, 10(14): e34376. https://doi.org/10.1016/j.heliyon.2024.e34376

[52] Ritharson, P.I., Raimond, K., Mary, X.A., Robert, J.E. (2024). DeepRice: A deep learning and deep feature based classification of Rice leaf disease subtypes. Artificial Intelligence in Agriculture, 11: 34-49. https://doi.org/10.1016/j.aiia.2023.11.001

[53] Kumar, M., Arora, A., Deb, A., Yadav, A.L. (2024). Deep learning for accurate plant disease classification using ResNet50: A comprehensive approach. In 2024 International Conference on Computational Intelligence and Computing Applications (ICCICA), Samalkha, India, 125-130. https://doi.org/10.1109/ICCICA60014.2024.10584860

[54] Thamarai Selvi, S.B., Thirumurugan, S., Kanish, P., Surya, M. (2024). Deep learning based paddy disease classification using ResNet-50. In Proceedings of the 1st International Conference on Artificial Intelligence, Communication, IoT, Data Engineering and Security (IACIDS 2023), Lavasa, Pune, India. https://doi.org/10.4108/eai.23-11-2023.2343203

[55] Akbarpour, O., Akbari, S., Fanourakis, D., Taheri-Garavand, A. (2026). Deep learning-based seed variety classification: A case study in maize. BMC Plant Biology, 26(1): 105. https://doi.org/10.1186/s12870-025-07919-3 

[56] Abdulghani, B.A., Al-Sulaifanie, A.I., Mohammed, A.K. (2025). A robust deep learning framework for multi-class fruit and vegetable classification using optimized AlexNet. Traitement du Signal, 42(6): 3345-3355. https://doi.org/10.18280/ts.420623

[57] Suma, K.G., Sunitha, G., Sreeram, G., Meghana, K., Deeraj, D., Supriya, K., Srinivasulu, A. (2024). AlexNet-based deep learning model for fruit image classification. In International Conference on Computer & Communication Technologies, India, pp. 51-59. https://doi.org/10.1007/978-981-96-5238-9_5 

[58] Singh, G., Singh, R.S., Mehta, R., Nayak, V.K. (2025). AgriNet: Leveraging MobileNetV2 with explainable AI and drone imaging for efficient plant disease diagnosis. In International Conference on Mathematical Modeling and Computational Science, pp. 254-262. https://doi.org/10.1007/978-3-031-91005-0_24 

[59] AG, B., Srinivasan, S., P, M., Mathivanan, S.K., Shah, M.A. (2024). Robust brain tumor classification by fusion of deep learning and channel-wise attention mode approach. BMC Medical Imaging, 24(1): 147. https://doi.org/10.1186/s12880-024-01323-3 

[60] Padmaja, D.L., Nikhil, B., Akshaya, B.S., Surya Deepak, G. (2023). A comparative analysis of ResNet and MobileNet for classifying MRI images. In International Conference on Data Science, Machine Learning and Applications, India, pp. 22-30. https://doi.org/10.1007/978-981-97-8031-0_3

[61] Talaat, F.M., El-Sappagh, S., Alnowaiser, K., Hassan, E. (2024). Improved prostate cancer diagnosis using a modified ResNet50-based deep learning architecture. BMC Medical Informatics and Decision Making, 24(1): 23. https://doi.org/10.1186/s12911-024-02419-0

[62] Aboh, M. (2025). Benchmarking EfficientNet, MobileNet and ResNet architectures for cassava disease detection in agricultural robotics. Journal of Engineering Research and Reports, 27(10): 380-387. https://doi.org/10.9734/jerr/2025/v27i101680

[63] Xu, Y., Li, D., Li, C., Yuan, Z., Dai, Z. (2025). LiSA-MobileNetV2: An extremely lightweight deep learning model with Swish activation and attention mechanism for accurate rice disease classification. Frontiers in Plant Science, 16: 1619365. https://doi.org/10.3389/fpls.2025.1619365 

[64] Albahli, S. (2025). AgriFusionNet: A lightweight deep learning model for multisource plant disease diagnosis. Agriculture, 15(14): 1523. https://doi.org/10.3390/agriculture15141523

[65] Yu, P., Teng, F., Zhu, W., Shen, C., Chen, Z., Song, J. (2025). Cloud–edge–device collaborative computing in smart agriculture: Architectures, applications, and future perspectives. Frontiers in Plant Science, 16: 1668545. https://doi.org/10.3389/fpls.2025.1668545 

[66] Lu, J., Liu, X., Ma, X., Tong, J., Peng, J. (2023). Improved MobileNetV2 crop disease identification model for intelligent agriculture. PeerJ Computer Science, 9: e1595. https://doi.org/10.7717/peerj-cs.1595