Most Read Articles

Published in last 1 year |  In last 2 years |  In last 3 years |  All
Please wait a minute...
Wolfberry pest detection based on improved YOLOv5
Dingjian DU,Zunhai GAO,Zhuo CHEN
Journal of ZheJiang University (Engineering Science)    2024, 58 (10): 1992-2000.   DOI: 10.3785/j.issn.1008-973X.2024.10.002
Abstract   HTML PDF (3603KB) ( 648 )  

A model based on improved YOLOv5m was proposed for wolfberry pest detection in a complex environment. The next generation vision transformer (Next-ViT) was used as the backbone network to improve the feature extraction ability of the model, and the key target features were given more attention by the model. An adaptive fusion context enhancement module was added to the neck to enhance the model’s ability to understand and process contextual information, and the precision of the model for the small object (aphids) detection was improved. The C3 module in the neck network was replaced by using the C3_Faster module to reduce the model footprint and further improve the model precision. Experimental results showed that the proposed model achieved a precision of 97.0% and a recall of 92.1%. The mean average precision (mAP50) was 94.7%, which was 1.9 percentage points higher than that of the YOLOv5m, and the average precision of aphid detection was improved by 9.4 percentage points. The mAP50 of different models were compared and the proposed was 1.6, 1.6, 2.8, 3.5, and 1.0 percentage points higher than the mainstream models YOLOv7, YOLOX, DETR, EfficientDet-D1, and Cascade R-CNN, respectively. The proposed model improves the detection performance while maintaining a reasonable model footprint.

Table and Figures | Reference | Related Articles | Metrics
Path planning of agricultural robots based on improved deep reinforcement learning algorithm
Wei ZHAO,Wanzhi ZHANG,Jialin HOU,Rui HOU,Yuhua LI,Lejun ZHAO,Jin Cheng
Journal of ZheJiang University (Engineering Science)    2025, 59 (7): 1492-1503.   DOI: 10.3785/j.issn.1008-973X.2025.07.017
Abstract   HTML PDF (2200KB) ( 234 )  

In order to solve the problems of difficulty in finding target points, sparse rewards, and slow convergence when using deep reinforcement learning algorithms for path planning of agricultural robots, a path-planning method based on multi-target point navigation integrated improved deep Q-network algorithm (MPN-DQN) was proposed. The laser simultaneous localization and mapping (SLAM) was used to scan the global environment to construct a prior map and divide the walking row and crop row areas, and the map boundary was expanded and fitted to form a forward bow-shaped operation corridor. The middle target point was used to segment the global environment, and the complex environment was divided into a multi-stage short-range navigation environment to simplify the target point search process. The deep Q-network algorithm was improved from three aspects: action space, exploration strategy and reward function to improve the reward sparsity problem, accelerate the convergence speed of the algorithm, and improve the navigation success rate. Experimental results showed that the total number of collisions of agricultural robots equipped with the MPN-DQN algorithm was 1, the average navigation time was 104.27 s, the average navigation distance was 16.58 m, and the average navigation success rate was 95%.

Table and Figures | Reference | Related Articles | Metrics
Area coverage path planning for tilt-rotor unmanned aerial vehicle based on enhanced genetic algorithm
Yue’an WU,Changping DU,Rui YANG,Jiahao YU,Tianrui FANG,Yao ZHENG
Journal of ZheJiang University (Engineering Science)    2024, 58 (10): 2031-2039.   DOI: 10.3785/j.issn.1008-973X.2024.10.006
Abstract   HTML PDF (2211KB) ( 830 )  

An enhanced genetic algorithm was proposed to address the challenge of area coverage path planning for a tilt-rotor unmanned aerial vehicle (TRUAV) amidst multiple obstacles. A preliminary coverage path plan for the designated task area was devised, utilizing the minimum spanning and back-and-forth path generation algorithms. The area coverage dilemma was transformed into a traveling salesman problem to optimize the sequence of the coverage path. A fishtail-shaped obstacle avoidance strategy was proposed to circumvent obstacles within the region. The nearest neighbor algorithm was introduced to generate a superior initial population than a genetic algorithm. A three-point crossover operator and a dynamic interval mutation operator were adopted in the genetic processes to improve the proposed algorithm's global search capacity and prevent the algorithm from falling into local optima. The efficacy of the proposed algorithm was rigorously tested through simulations in polygonal areas with multiple obstacles. Results showed that, compared to the sequential path coverage algorithm and the genetic algorithm, the proposed algorithm reduced the length of the coverage path by 7.80%, significantly enhancing the coverage efficiency of TRUAV in the given task areas.

Table and Figures | Reference | Related Articles | Metrics
Structure design and motion analysis of bionic hexapod origami robot
Dongxing CAO,Yanchao JIA,Xiangying GUO,Jiajia MAO
Journal of ZheJiang University (Engineering Science)    2024, 58 (8): 1543-1555.   DOI: 10.3785/j.issn.1008-973X.2024.08.002
Abstract   HTML PDF (3603KB) ( 374 )  

A new design scheme of crab-like hexapod origami robot was proposed by combining the origami structure with the multi-legged robot design and coupling Miura origami and six-fold origami aiming at the problems that the existing origami robots have a single structure and insufficient flexibility in movement. The motion configuration of the origami robot was expanded, and the motion flexibility of the origami robot was improved. Each leg of the robot has two degrees of freedom under the symmetry hypothesis. The vertices of the robot legs were treated as joints, and the crease lines were regarded as links. A planar link equivalent model of the robot legs was established with the folding angle as the motion variable. The theoretical range of motion for the robot’s foot was determined through simulation calculations. Then tapered panel technique was utilized to thicken the folding surfaces and prevent physical interference between adjacent folding surfaces. A three-dimensional model of the origami crab-like hexapod robot was constructed. The relationship between the folding angle and foot motion was analyzed based on the equivalent model of planar links, and the foot motion trajectory and gait of the robot were designed. The experimental prototype of origami bionic hexapod robot was designed and manufactured by using 3D printing technology, and the lateral movement of the robot was realized based on STM32 microcontroller control. Results show that the origami bio-inspired robot can realize the conversion from plane configuration to a crab-like configuration. The robot can move smoothly left and right under the coordinated movement of six legs.

Table and Figures | Reference | Related Articles | Metrics
Improved YOLOv7 based apple target detection in complex environment
Henghui MO,linjing WEI
Journal of ZheJiang University (Engineering Science)    2024, 58 (12): 2447-2458.   DOI: 10.3785/j.issn.1008-973X.2024.12.004
Abstract   HTML PDF (1927KB) ( 439 )  

Robotic harvesters face challenges in identifying apples under complex natural conditions such as unstable lighting, high fruit diversity, and severe leaf occlusion, which impedes the capture of key features, reducing harvesting efficiency and accuracy. An enhanced apple detection algorithm based on the YOLOv7 model for complex scenarios was proposed. A limited contrast adaptive histogram equalization technique was employed to enhance the contrast of apple images, reducing the background interference and clarifying the target contours. A multi-scale hybrid adaptive attention mechanism was introduced. The features were decomposed and reconstructed, and the spatial and channel attention directives were synergistically integrated to optimize multi-layer feature modeling over various distances, thereby boosting the model’s capability to extract apple features and resist background noise. Full-dimensional dynamic convolution was implemented to refine the feature selection process through a meticulous attention mechanism. The number of detection heads was increased to address the challenges of detecting small targets. The Meta-ACON activation function was used to optimize the attention allocation during feature extraction process. Experimental results demonstrated that the improved YOLOv7 model, achieved average accuracy and recall rates of 85.7% and 87.0%, respectively. Compared to Faster R-CNN, SSD, YOLOv5, and the original YOLOv7, the average detection precision was improved by 15.2, 7.5, 4.5, and 2.5 percentage points, and the average recall was improved by 13.7, 6.5, 3.6, and 1.3 percentage points, respectively. The model exhibits exceptional performance, providing robust technical support for apple growth monitoring and mechanical harvesting research.

Table and Figures | Reference | Related Articles | Metrics
Evaluation of generator side inertia based on electromechanical oscillation of power system
Zhiqiang REN,Mingxing TIAN,Yu JIANG,Dongfeng XING
Journal of ZheJiang University (Engineering Science)    2025, 59 (4): 870-878.   DOI: 10.3785/j.issn.1008-973X.2025.04.023
Abstract   HTML PDF (1102KB) ( 292 )  

The connection of new energy power generation equipment to the power generation side leads to the emergence of “weak inertia” characteristics on the power generation side, which affects the safe and stable operation of the system. The synchronous phase measurement unit (PMU) was used to measure the electromechanical oscillation response, and based on the electromechanical oscillation parameter under small perturbation, an inertia assessment method for the power generation side was proposed. Based on the characteristics of the inertia response process, the unbalanced power allocation equation related to the inertia of each generator was derived. Based on the relationship between the small-signal state equation and the characteristic root of the multi-machine system, the formula for calculating the inertia of the generation side of a multi-machine system was derived. The inertia calculation of the generation side of a single-machine system was introduced, and the measurement methods of inertia ratio and the intrinsic oscillation frequency in the inertia calculation formula were described. The correctness of the proposed method was verified by simulation examples of a single-machine system, a dual-machine interconnection system, a WSCC 3-machine 9-node system, and a 10-machine 39-node system. Results show that the generation side inertia evaluation values obtained with the proposed method in several systems are close to the actual values and have good adaptability. The method can be used for power system generation side inertia evaluation.

Table and Figures | Reference | Related Articles | Metrics
Empty-load charging strategy for autonomous vehicle parking based on multi-agent system
Wenhao LI,Yanjie JI,Hao WU,Yewen JIA,Shuichao ZHANG
Journal of ZheJiang University (Engineering Science)    2024, 58 (8): 1659-1670.   DOI: 10.3785/j.issn.1008-973X.2024.08.013
Abstract   HTML PDF (2970KB) ( 253 )  

A multi-agent parking simulation framework was constructed in order to formulate autonomous vehicle (AV) parking demand management strategies. Two charging strategies for empty-load driving were proposed: a static charge based on driving distance and a dynamic charge based on road congestion levels. Rate calculation method was analyzed. Cost functions for parking lots, residential parking, and continuous empty cruising were established under these charging policies. A logit model was used to describe the choice behavior under different parking modes. The simulation of urban mobility (SUMO) was used to conduct a large-scale road network simulation experiment in Nanning’s main urban area. AV parking behavior and road network operation under both strategies were analyzed. The simulation results showed that the empty-load driving mileage of AVs decreased by 20.16% and 10.85% under the static and dynamic charging strategies, respectively. Total vehicle delay decreased by 39.80% and 43.52%, respectively. The dynamic charging strategy was adjustable in real-time based on road conditions, and operational efficiency of the road network was significantly enhanced.

Table and Figures | Reference | Related Articles | Metrics
3D path planning of plant protection UAVs in hilly mountainous orchards
Shaomeng YU,Ming YAN,Pengfei WANG,Jianxi ZHU,Xin YANG
Journal of ZheJiang University (Engineering Science)    2025, 59 (3): 635-642.   DOI: 10.3785/j.issn.1008-973X.2025.03.021
Abstract   HTML PDF (2985KB) ( 250 )  

A full-coverage 3D path planning method for mountainous orchard plant protection UAVs was proposed to address the challenges of manual control and the lack of 3D path planning for plant protection drones operating in hilly orchards. 3D coordinates of the operation area obtained from a real scene 3D model of the area were utilized. Comprehensive 3D path planning for plant protection UAVs was carried out based on the reciprocating cattle farming method and the real scene 3D model of the hilly orchard. An energy consumption model for the UAV was constructed, considering its movement status and load changes. The operating heading angle (ranging from 1° to 180°) was optimized to determine the path with minimal energy consumption. Results of field experiments showed that the path with the minimal energy consumption (heading angle of 91°) reduced the total energy consumption by 20.88% and the time required to complete the plant protection operation by 16.31%, compared to the path with the maximum energy consumption (heading angle of 147°). The fluctuation in canopy droplet deposition at each sampling point within the operation area was minimal. This method not only optimizes the energy consumption and improves the operational efficiency, but also ensures full coverage of plant protection within the working area.

Table and Figures | Reference | Related Articles | Metrics
Optimization of 3D multi-UAVs low altitude penetration based on bald eagle search algorithm
Xialu WEN,He HUANG,Huifeng WANG,Lan YANG,Tao GAO
Journal of ZheJiang University (Engineering Science)    2024, 58 (10): 2020-2030.   DOI: 10.3785/j.issn.1008-973X.2024.10.005
Abstract   HTML PDF (2152KB) ( 460 )  

In response to the complex three-dimensional space environment and the high computational complexity of low altitude penetration path planning for multi-UAVs, the existing multi-objective bald eagle search algorithm has the shortcomings of easily approaching the center point and low accuracy. A 3D multi-UAVs low altitude penetration method based on the improved multi-objective bald eagle search algorithm (IMBES) was proposed. Models for the 3D environment, threat sources, UAV physical constraints, multi-UAVs cooperative constraints, and path smoothness were constructed to define a multi-objective cost function. A coupling chaotic mapping initialization was designed to enhance the quality of the initial population. An adaptive Gauss walk strategy based on the “scout eagle” was devised to balance development and search capabilities. Fast non-dominated sorting was introduced to further enhance algorithm efficiency. By leveraging the correspondence between the bald eagle position and UAV speed, turning angle, and climbing angle, the IMBES efficiently explored the UAV configuration space to identify the optimal Pareto front. Experimental results showed that the success rate of the IMBES was 70.5%. Compared with existing path planning methods, the proposed method demonstrates strong optimization capabilities and low energy consumption, making it suitable for collaborative low-altitude penetration by multiple UAVs.

Table and Figures | Reference | Related Articles | Metrics
Intelligent connected vehicle motion planning at unsignalized intersections based on deep reinforcement learning
Mingfang ZHANG,Jian MA,Nale ZHAO,Li WANG,Ying LIU
Journal of ZheJiang University (Engineering Science)    2024, 58 (9): 1923-1934.   DOI: 10.3785/j.issn.1008-973X.2024.09.017
Abstract   HTML PDF (2586KB) ( 256 )  

A vehicle motion planning algorithm based on deep reinforcement learning was proposed to satisfy the efficiency and comfort requirements of intelligent connected vehicles at unsignalized intersections. Temporal convolutional network (TCN) and Transformer algorithms were combined to construct the intention prediction model for surrounding vehicles. The multi-layer convolution and self-attention mechanisms were used to improve the capability of capturing vehicle motion feature. The twin delayed deep deterministic policy gradient (TD3) reinforcement learning algorithm was employed to build the vehicle motion planning model. Taking the driving intention of surrounding vehicle, driving style, interaction risk, and the comfort of ego vehicle into consideration comprehensively, the state space and reward functions were designed to enhance understanding the dynamic environment. Delaying the policy updates and smoothing the target policies were conducted to improve the stability of the proposed algorithm, and the desired acceleration was output in real-time. Experimental results demonstrated that the proposed motion planning algorithm can perceive the real-time potential interaction risk based on the driving intention of surrounding vehicles. The generated motion planning strategy met the requirements of the efficiency, safety and comfort. It showed excellent adaptability to different styles of surrounding vehicles and dense interaction scenarios, and the success rates exceeded 92.1% in various scenarios.

Table and Figures | Reference | Related Articles | Metrics
Survey of embodied agent in context of foundation model
Songyuan LI,Xiangwei ZHU,Xi LI
Journal of ZheJiang University (Engineering Science)    2025, 59 (2): 213-226.   DOI: 10.3785/j.issn.1008-973X.2025.02.001
Abstract   HTML PDF (841KB) ( 835 )  

Foundational models in natural language processing, computer vision and multimodal learning have achieved significant breakthroughs in recent years, showcasing the potential of general artificial intelligence. However, these models still fall short of human or animal intelligence in areas such as causal reasoning and understanding physical commonsense. This is because these models primarily rely on vast amounts of data and computational power, lacking direct interaction with and experiential learning from the real world. Many researchers are beginning to question whether merely scaling up model size is sufficient to address these fundamental issues. This has led the academic community to reevaluate the nature of intelligence, suggesting that intelligence arises not just from enhanced computational capabilities but from interactions with the environment. Embodied intelligence is gaining attention as it emphasizes that intelligent agents learn and adapt through direct interactions with the physical world, exhibiting characteristics closer to biological intelligence. A comprehensive survey of embodied artificial intelligence was provided in the context of foundational models. The underlying technical ideas, benchmarks, and applications of current embodied agents were discussed. A forward-looking analysis of future trends and challenges in embodied AI was offered.

Table and Figures | Reference | Related Articles | Metrics
Identification of apple leaf diseases based on MA-ConvNext network and stepwise relational knowledge distillation
Huan LIU,Yunhong LI,Leitao ZHANG,Yue GUO,Xueping SU,Yaolin ZHU,Lele HOU
Journal of ZheJiang University (Engineering Science)    2024, 58 (9): 1757-1767.   DOI: 10.3785/j.issn.1008-973X.2024.09.001
Abstract   HTML PDF (5637KB) ( 831 )  

The backgrounds are cluttered, the spot sizes of apple leaf disease are varying in complex environments, and the existing models have the problems of multiple parameters and a large amount of calculation. Thus, an apple leaf disease recognition network, ConvNext network based on attention and multiscale feature fusion (MA-ConvNext), was proposed. A multiscale spatial reconstruction and channel reconstruction block (MSCB) and a feature extraction block with triplet attention fusion (TAFB) were utilized to effectively extract the features at different scales and enhance the focus on leaf disease spots. Additionally, a stepwise relational knowledge distillation method was employed to fuse the "teacher" network (MA-ConvNext) with an "intermediate" network (DenseNet121) to guide the training of the "student" network (EfficientNet-B0) and achieve the model lightweighting. Experimental results showed that MA-ConvNext achieved a recognition accuracy of 99.38%, improving by 3.98 percentage points, 7.55 percentage points and 4.27 percentage points compared to ResNet50, MobileNet-V3, and EfficientNet-V2 networks, respectively. After the stepwise relational knowledge distillation, the recognition accuracy further improved by 1.76 percentage points, with a smaller network size and parameters of 1.56×107 and 5.29×106. respectively. The proposed method offers new insights and technical support for the precise detection of pests and diseases in agriculture.

Table and Figures | Reference | Related Articles | Metrics
Attention-fused filter bank dual-view graph convolution motor imagery EEG classification
Shuhan WU,Dan WANG,Yuanfang CHEN,Ziyu JIA,Yueqi ZHANG,Meng XU
Journal of ZheJiang University (Engineering Science)    2024, 58 (7): 1326-1335.   DOI: 10.3785/j.issn.1008-973X.2024.07.002
Abstract   HTML PDF (2102KB) ( 353 )  

In motor imagery tasks, the brain often involves simultaneous activation of multiple regions, and traditional convolutional neural networks struggle to accurately represent the coordinated neural activity across these regions. Graph convolutional network GCN is suitable for representing the collaborative tasks of different brain regions by considering the connections and relationships between nodes (brain regions) in graph data. Attention-fused filter bank dual-view GCN(AFB-DVGCN)was proposed. A dual-branch network was constructed using filter banks to extract temporal and spatial information from different frequency bands. Information complementarity was achieved by a convolutional spatial feature extraction method for dual-view graphs. In order to improve the classification accuracy, the effective channel attention mechanism was utilized to enhance features and capture the interaction information between different feature maps. Validation results in the publicly available datasets BCI Competition IV-2a and OpenBMI show that AFB-DVGCN has achieved good classification performance, and the classification accuracy is significantly higher than that of the comparison networks.

Table and Figures | Reference | Related Articles | Metrics
Super-resolution reconstruction of remote sensing image based on CNN and Transformer aggregation
Mingzhi HU,Jun SUN,Biao YANG,Kairong CHANG,Junlong YANG
Journal of ZheJiang University (Engineering Science)    2025, 59 (5): 938-946.   DOI: 10.3785/j.issn.1008-973X.2025.05.007
Abstract   HTML PDF (12404KB) ( 388 )  

A multi-layer degradation module was proposed aiming at the problem that most remote sensing image super-resolution models rarely consider the impact of noise, blur, JPEG compression, and other factors on image reconstruction, as well as the limitations of Transformer modules in capturing high-frequency information. A CNN-Transformer hybrid network was designed, where CNN captures high-frequency details and Transformer extracts global information. These two components were combined by an attention-based aggregation module, enhancing local high-frequency detail reconstruction while maintaining global structural coherence. The model was tested on six random scenes from the AID dataset and compared with the MM-realSR model in PSNR and SSIM. Results show an average PSNR improvement of 1.61 dB and a SSIM increase of 0.023 over MM-realSR.

Table and Figures | Reference | Related Articles | Metrics
Control strategy of power conversion system based on sliding mode active disturbance rejection control
Jinfeng HUANG,Jie ZHOU,Hongjie HUANG
Journal of ZheJiang University (Engineering Science)    2024, 58 (10): 2171-2181.   DOI: 10.3785/j.issn.1008-973X.2024.10.021
Abstract   HTML PDF (1993KB) ( 186 )  

In order to improve the dynamic performance of the power conversion system (PCS), an improved active disturbance rejection control (ADRC) strategy based on reduced-order cascaded extended state observer (ESO) and complementary sliding mode control (CSMC) was designed and applied to the voltage outer loop of the bidirectional DC/AC converter in the PCS. The ESO was modified to a reduced-order cascaded ESO to improve the estimation speed of the state variables and the overall disturbance, enhancing the disturbance estimation capability. The PD control was replaced with CSMC to design a state error feedback law to enhance the robustness of the system, and an improved exponential reaching law was designed to suppress the chattering phenomenon. A simulation model was established and a related experimental platform was built to demonstrate the superiority of the improved ADRC strategy compared to PI control and traditional ADRC. The simulation and experimental results show that the improved ADRC strategy reduces the fluctuation of the DC bus voltage during the transient operation of the PCS, improves the power response speed on the AC side of the PCS, and enhances the output power quality on the AC side.

Table and Figures | Reference | Related Articles | Metrics
Oriented ship detection algorithm in SAR image based on improved YOLOv5
Yali XUE,Yiming HE,Shan CUI,Quan OUYANG
Journal of ZheJiang University (Engineering Science)    2025, 59 (2): 261-268.   DOI: 10.3785/j.issn.1008-973X.2025.02.004
Abstract   HTML PDF (2761KB) ( 773 )  

A novel detection algorithm (efficient multi-scale attention (EMA) and small object detection based on YOLOv5, ES-YOLOv5) was proposed by targeting small ship targets in SAR scenes aiming at the issues of inconspicuous imaging features and low detection accuracy caused by arbitrary orientation of small targets in synthetic aperture radar (SAR) imaging. A small target detection layer was added to adjust the receptive field size, making it more suitable for capturing small target scale features and facilitating multi-scale fusion. An EMA mechanism was introduced to focus on key target information and enhance feature representation capability. The circular smooth label (CSL) technique was utilized to adapt to the periodicity of angles, achieving high-precision angle classification. The experimental results demonstrate that the proposed method achieves an average detection accuracy of 90.9% at an intersection over union (IoU) threshold of 0.5 on the RSDD-SAR dataset. The algorithm outperforms the baseline algorithm YOLOv5 by 6% in improving the precision of detecting small SAR ship targets, significantly enhancing the model’s detection performance.

Table and Figures | Reference | Related Articles | Metrics
Occluded human pose estimation network based on knowledge sharing
Jiahong JIANG,Nan XIA,Changwu LI,Xinmiao YU
Journal of ZheJiang University (Engineering Science)    2024, 58 (10): 2001-2010.   DOI: 10.3785/j.issn.1008-973X.2024.10.003
Abstract   HTML PDF (1801KB) ( 229 )  

A new estimation network was proposed for improving the insufficient occlusion handling ability of existing human pose estimation methods. An occluded parts enhanced convolutional network (OCNN) and an occluded features compensation graph convolutional network (OGCN) were included in the proposed network. A high-low order feature matching attention was designed to strengthen the occlusion area features, and high-adaptation weights were extracted by OCNN, achieving enhanced detection of the occluded parts with a small amount of occlusion data. OGCN strengthened the shared and private attribute compensation node features by eliminating the obstacle features. The adjacency matrix was importance-weighted to enhance the quality of the occlusion area features and to improve the detection accuracy. The proposed network achieved detection accuracy of 78.5%, 67.1%, and 77.8% in the datasets COCO2017, COCO-Wholebody, and CrowdPose, respectively, outperforming the comparative algorithms. The proposed network saved 75% of the training data usage in the self-built occlusion dataset.

Table and Figures | Reference | Related Articles | Metrics
EEG microstate functional network analysis of different emotional false memories
Yixuan LI,Ying LI,Qian XIAO,Lingyue WANG,Ning YIN,Shuo YANG
Journal of ZheJiang University (Engineering Science)    2025, 59 (1): 49-61.   DOI: 10.3785/j.issn.1008-973X.2025.01.005
Abstract   HTML PDF (5110KB) ( 152 )  

The research on the influence of emotions on false memory helps explore the memory-processing mechanisms of the brain. The EEG signals of false memories under different emotion states were collected. The microstate analysis was used to obtain the template maps for each emotion group named from microstate 1 to microstate 5, the time segmentation of the four stages of memory recognition (early processing, familiar processing, episodic recall processing and post-extraction processing) for the emotion groups were divided according to the microstate fitting results, and the phase-locked brain functional networks were constructed in microstates with significant difference in time coverage. The results analyzed of EEG signals from both the temporal perspective and the spatial perspective show that the brain processing patterns of the emotion groups begin to appear different from the episode recall processing stage. The positive group remains in the active microstates 3 and 5 of the prefrontal region and has strong brain function, the negative group remains in microstate 1 and has poor brain function, and the neutral group remains in the active microstates 3 and 4 of the central region. The positive group spends more time and mental resources on plot association and reasoning, while the negative group stays depressed for a longer time, and the neutral group devotes more time and mental resources to information integration.

Table and Figures | Reference | Related Articles | Metrics
Surface defect detection method for aluminum profiles based on improved YOLOv7-tiny
Junyin WANG,Bin WEN,Yanjun SHEN,Jun ZHANG,Zihao WANG
Journal of ZheJiang University (Engineering Science)    2025, 59 (3): 523-534.   DOI: 10.3785/j.issn.1008-973X.2025.03.010
Abstract   HTML PDF (4334KB) ( 224 )  

An improved YOLOv7-tiny detection algorithm was proposed to address the problems such as various types of surface defects in aluminum profiles, large differences in defect scales and missed detection of small target defects. The spatial pyramid pooling module was reconstructed by utilizing the residual structure, parameter-free attention mechanism (SimAM), activation function (FReLU) and clipping convolution to capture more detailed information and strengthen the multi-scale learning ability of the network. The optimized detection layer was used to obtain more small target features and location information, and improve the detection ability of network multi-scale defect. Partial convolution was introduced to replace the 3×3 convolution in the efficient layer aggregation network (ELAN), then the lightweight model was used to reduce the computing and training burden. Combined with the similarity of normalized Wasserstein distance (NWD) loss measurement, the network convergence was accelerated and the detection ability of small target defects was improved. Test was conducted on the Tianchi aluminium profile dataset, and the results showed that the improved YOLOv7-tiny algorithm achieved the accuracy, recall, mean average accuracy (mAP@0.5) and detection speed of 95.0%, 91.8%, 94.5% and 45 frames per second, respectively, when the confidence threshold was 0.25. Compared with the original algorithm, the mAP@0.5 of the improved algorithm was increased by 4.2 percentage point as a whole, the average accuracy (AP) of the dirty spot defect was increased by 13.1 percentage point; the detection results of the improved algorithm for low-resolution images and interfered images was better than of the original algorithm, which showed that the proposed method had better generalization and anti-interference ability.

Table and Figures | Reference | Related Articles | Metrics
Tomato leaf disease detection based on improved CenterNet algorithm
Ya LI,Chen JIANG,Hairui WANG,Guifu ZHU,Can HU
Journal of ZheJiang University (Engineering Science)    2025, 59 (11): 2370-2378.   DOI: 10.3785/j.issn.1008-973X.2025.11.016
Abstract   HTML PDF (3346KB) ( 473 )  

A tomato leaf disease detection model based on the improved CenterNet algorithm was proposed in order to address the false detection and missed detection phenomena in traditional tomato leaf disease detection. A feature fusion module that integrated the attention mechanism was constructed in order to enhance the model's cross-scale feature fusion capability. The multi-branch convolutional module RFB was added to the backbone network in order to expand the receptive field and enhance the ability to extract target features. The pyramid convolution PyConv was introduced into the backbone network to enhance the extraction of multi-scale features by calculating receptive fields of different scales and reduce information loss. Pruning optimization strategies were designed in order to reduce the impact of introducing modules on the number of model parameters and computational load. The test results showed that the accuracy rate, recall rate, mAP50 and mAP50:95 of the improved model reached 96.3%, 80.2%, 91.4% and 78.7% respectively. The proposed model can effectively improve the accuracy of tomato leaf disease detection, and the model has good generalization.

Table and Figures | Reference | Related Articles | Metrics