Most Downloaded Articles

Published in last 1 year | In last 2 years| In last 3 years| All| Most Downloaded in Recent Month| Most Downloaded in Recent Year|

Most Downloaded in Recent Month
Please wait a minute...
Survey of deep learning based EEG data analysis technology
Bo ZHONG,Pengfei WANG,Yiqiao WANG,Xiaoling WANG
Journal of ZheJiang University (Engineering Science)    2024, 58 (5): 879-890.   DOI: 10.3785/j.issn.1008-973X.2024.05.001
Abstract   HTML PDF (690KB) ( 1900 )  

A thorough analysis and cross-comparison of recent relevant works was provided, outlining a closed-loop process for EEG data analysis based on deep learning. EEG data were introduced, and the application of deep learning in three key stages: preprocessing, feature extraction, and model generalization was unfolded. The research ideas and solutions provided by deep learning algorithms in the respective stages were delineated, including the challenges and issues encountered at each stage. The main contributions and limitations of different algorithms were comprehensively summarized. The challenges faced and future directions of deep learning technology in handling EEG data at each stage were discussed.

Table and Figures | Reference | Related Articles | Metrics
Driver fatigue state detection method based on multi-feature fusion
Hao-jie FANG,Hong-zhao DONG,Shao-xuan LIN,Jian-yu LUO,Yong FANG
Journal of ZheJiang University (Engineering Science)    2023, 57 (7): 1287-1296.   DOI: 10.3785/j.issn.1008-973X.2023.07.003
Abstract   HTML PDF (1481KB) ( 791 )  

The improved YOLOv5 object detection algorithm was used to detect the facial region of the driver and a multi-feature fusion fatigue state detection method was established aiming at the problem that existing fatigue state detection method cannot be applied to drivers under the epidemic prevention and control. The image tag data including the situation of wearing a mask and the situation without wearing a mask were established according to the characteristics of bus driving. The detection accuracy of eyes, mouth and face regions was improved by increasing the feature sampling times of YOLOv5 model. The BiFPN network structure was used to retain multi-scale feature information, which makes the prediction network more sensitive to targets of different sizes and improves the detection ability of the overall model. A parameter compensation mechanism was proposed combined with face keypoint algorithm in order to improve the accuracy of blink and yawn frame number. A variety of fatigue parameters were fused and normalized to conduct fatigue classification. The results of the public dataset NTHU and the self-made dataset show that the proposed method can recognize the blink and yawn of drivers both with and without masks, and can accurately judge the fatigue state of drivers.

Table and Figures | Reference | Related Articles | Metrics
Survey of embodied agent in context of foundation model
Songyuan LI,Xiangwei ZHU,Xi LI
Journal of ZheJiang University (Engineering Science)    2025, 59 (2): 213-226.   DOI: 10.3785/j.issn.1008-973X.2025.02.001
Abstract   HTML PDF (841KB) ( 208 )  

Foundational models in natural language processing, computer vision and multimodal learning have achieved significant breakthroughs in recent years, showcasing the potential of general artificial intelligence. However, these models still fall short of human or animal intelligence in areas such as causal reasoning and understanding physical commonsense. This is because these models primarily rely on vast amounts of data and computational power, lacking direct interaction with and experiential learning from the real world. Many researchers are beginning to question whether merely scaling up model size is sufficient to address these fundamental issues. This has led the academic community to reevaluate the nature of intelligence, suggesting that intelligence arises not just from enhanced computational capabilities but from interactions with the environment. Embodied intelligence is gaining attention as it emphasizes that intelligent agents learn and adapt through direct interactions with the physical world, exhibiting characteristics closer to biological intelligence. A comprehensive survey of embodied artificial intelligence was provided in the context of foundational models. The underlying technical ideas, benchmarks, and applications of current embodied agents were discussed. A forward-looking analysis of future trends and challenges in embodied AI was offered.

Table and Figures | Reference | Related Articles | Metrics
Research overview on touchdown detection methods for footed robots
Xiaoyong JIANG,Kaijian YING,Qiwei WU,Xuan WEI
Journal of ZheJiang University (Engineering Science)    2024, 58 (2): 334-348.   DOI: 10.3785/j.issn.1008-973X.2024.02.012
Abstract   HTML PDF (1751KB) ( 802 )  

The effects of leg structure design, foot-end design and sensor design on touchdown detection were comprehensively discussed by analyzing the existing legged robot touchdown detection methods. The touchdown method for direct detection of external sensors, the touchdown detection method based on kinematics and dynamics, and the touchdown detection method based on learning were summarized. Touchdown detection methods were summarized in three special scenarios: slippery ground, soft ground, and non-foot-end contact. The application scenarios of touchdown detection technology were analyzed, including the three application scenarios of motion control requirements, navigation applications, and terrain and geological sensing. The development trends were pointed out, which related to the four major touchdown detection methods of hardware improvement and integration, multi-mode touchdown detection, multi-sensor fusion touchdown detection, and intelligent touchdown detection. The specific relationships between various touchdown detection algorithms were summarized, which provided guidance for the development of follow-up technology for touchdown detection and specific applications of touchdown detection.

Table and Figures | Reference | Related Articles | Metrics
Research progress of porous materials with low dielectric constant
WANG Jia-Bang, ZHANG Guo-Quan
J4    2009, 43 (5): 957-961.   DOI: 10.3785/j.issn.1008-973X.2009.05.033
Abstract   PDF (699KB) ( 1787 )  

The porous materials with low dielectric constant are suitable for the applications in integrated circuits. From the aspects of composition and structure, preparation method and dielectric properties, this work introduced the porous low-dielectric-constant materials with different matrix such as inorganic materials, organic materials, inorganic and organic composite separately, whose dielectric constants can be reduced to 1.99, 1.50, 1.99, respectively. The using temperature of the porous low-dielectric-constant materials with organic matrix can reach 450 ℃. The flexural strength of the porous low-dielectric-constant materials with inorganic matrix can reach 136 MPa. The introduction of cave into the materials leads to the decrease of mechanical properties and the increase of dielectric loss. The effort to get a low-dielectric-constant and improve the above properties can broaden the application scope of the porous materials.

Reference | Related Articles | Metrics
Multi-agent pursuit and evasion games based on improved reinforcement learning
Ya-li XUE,Jin-ze YE,Han-yan LI
Journal of ZheJiang University (Engineering Science)    2023, 57 (8): 1479-1486.   DOI: 10.3785/j.issn.1008-973X.2023.08.001
Abstract   HTML PDF (1158KB) ( 939 )  

A multi-agent reinforcement learning algorithm based on priority experience replay and decomposed reward function was proposed in multi-agent pursuit and evasion games. Firstly, multi-agent twin delayed deep deterministic policygradient algorithm (MATD3) algorithm based on multi-agent deep deterministic policy gradient algorithm (MADDPG) and twin delayed deep deterministic policy gradient algorithm (TD3) was proposed. Secondly, the priority experience replay was proposed to determine the priority of experience and sample the experience with high reward, aiming at the problem that the reward function is almost sparse in the multi-agent pursuit and evasion problem. In addition, a decomposed reward function was designed to divide multi-agent rewards into individual rewards and joint rewards to maximize the global and local rewards. Finally, a simulation experiment was designed based on DEPER-MATD3. Comparison with other algorithms showed that DEPER-MATD3 algorithm solved the over-estimation problem, and the time consumption was improved compared with MATD3 algorithm. In the decomposed reward function environment, the global mean rewards of the pursuers were improved, and the pursuers had a greater probability of chasing the evader.

Table and Figures | Reference | Related Articles | Metrics
Multi-behavior aware service recommendation based on hypergraph graph convolution neural network
Jia-wei LU,Duan-ni LI,Ce-ce WANG,Jun XU,Gang XIAO
Journal of ZheJiang University (Engineering Science)    2023, 57 (10): 1977-1986.   DOI: 10.3785/j.issn.1008-973X.2023.10.007
Abstract   HTML PDF (1380KB) ( 421 )  

A multi-behavior aware service recommendation method based on hypergraph graph convolutional neural network (MBSRHGNN) was proposed to resolve the problem of insufficient high-order service feature extraction in existing service recommendation methods. A multi-hypergraph was constructed according to user-service interaction types and service mashups. A dual-channel hypergraph convolutional network was designed based on the spectral decomposition theory with functional and structural properties of multi-hypergraph. Chebyshev polynomial was used to approximate hypergraph convolution kernel to reduce computational complexity. Self-attention mechanism and multi-behavior recommendation methods were combined to measure the importance difference between multi-behavior interactions during the hypergraph convolution process. A hypergraph pooling method named HG-DiffPool was proposed to reduce the feature dimensionality. The probability distribution for recommending different services was learned by integrating service embedding vector and hypergraph signals. Real service data was obtained by the crawler and used to construct datasets with different sparsity for experiments. Experimental results showed that the MBSRHGNN method could adapt to recommendation scenario with highly sparse data, and was superior to the existing baseline methods in accuracy and relevance.

Table and Figures | Reference | Related Articles | Metrics
Multi-target tracking of vehicles based on optimized DeepSort
Li-sheng JIN,Qiang HUA,Bai-cang GUO,Xian-yi XIE,Fu-gang YAN,Bo-tao WU
Journal of ZheJiang University (Engineering Science)    2021, 55 (6): 1056-1064.   DOI: 10.3785/j.issn.1008.973X.2021.06.005
Abstract   HTML PDF (1014KB) ( 1244 )  

A front multi-vehicle target tracking algorithm optimized by DeepSort was proposed in order to improve the awareness of autonomous vehicles to the surrounding environment. Gaussian YOLO v3 model was adopted as the front-end target detector, and training was based on DarkNet-53 backbone network. Gaussian YOLO v3-Vehicle, a detector specially designed for vehicles was obtained, which improved the vehicle detection accuracy by 3%. The augmented VeRi data set was proposed to conduct the re-recognition pre-training in order to overcome the shortcomings that the traditional pre-training model doesn't target vehicles. A new loss function combining the central loss function and the cross entropy loss function was proposed, which can make the target features extracted by the network become better in-class aggregation and inter-class resolution. Actual road videos in different environments were collected in the test part, and CLEAR MOT evaluation index was used for performance evaluation. Results showed a 1% increase in tracking accuracy and a 4% reduction in identity switching times compared with the benchmark DeepSort YOLO v3.

Table and Figures | Reference | Related Articles | Metrics
Review of underground pipeline monitoring research based on distributed fiber optic sensing
Hai-ying WU,Hong-hu ZHU,Bao ZHU,He QI
Journal of ZheJiang University (Engineering Science)    2019, 53 (6): 1057-1070.   DOI: 10.3785/j.issn.1008-973X.2019.06.005
Abstract   HTML PDF (1057KB) ( 1375 )  

Outline the important role of underground pipelines in national economy and defense construction, as well as the possible serious consequences of pipeline failure. Point out that the real-time monitoring of underground pipelines by using distributed fiber optic sensing (DFOS) technology can guarantee the structural health and safe operation of pipelines. Introduce the pipeline monitoring principle based on DFOS technology, and the research progress of DFOS technology in pipeline leakage monitoring, third party intrusion monitoring, deformation monitoring, corrosion monitoring, geological and natural disaster monitoring and submarine pipeline monitoring. Analyze some existing problems and hot topics in the current research, as well as the future research trend.

Table and Figures | Reference | Related Articles | Metrics
Target tracking algorithm based on dynamic position encoding and attention enhancement
Changzhen XIONG,Chuanxi GUO,Cong WANG
Journal of ZheJiang University (Engineering Science)    2024, 58 (12): 2427-2437.   DOI: 10.3785/j.issn.1008-973X.2024.12.002
Abstract   HTML PDF (1684KB) ( 231 )  

A method based on dynamic position encoding and multi-domain attention feature enhancement was proposed to fully exploit the positional information between the template and search region and harness the feature representation capabilities. Firstly, a position encoding module with convolutional operations was embedded within the attention module. Position encoding was updated with attention calculations to enhance the utilization of spatial structural information. Next, a multi-domain attention enhancement module was introduced. Sampling was conducted in the spatial dimension using parallel convolutions with different dilation rates and strides to cope with targets of different sizes and aggregate the enhanced channel attention features. Finally, a spatial domain attention enhancement module was incorporated into the decoder to provide accurate classification and regression features for the prediction head. The proposed algorithm achieved an average overlap (AO) of 73.9% on the GOT-10K dataset. It attained area under the curve (AUC) scores of 82.7%, 69.3%, and 70.9% on the TrackingNet, UAV123, and OTB100 datasets, respectively. Comparative results with state-of-the-art algorithms demonstrated that the tracking model, which integrated dynamic position encoding as well as channel and spatial attention enhancement, effectively enhanced the interaction of information between the template and search region, leading to improved tracking accuracy.

Table and Figures | Reference | Related Articles | Metrics
Survey on program representation learning
Jun-chi MA,Xiao-xin DI,Zong-tao DUAN,Lei TANG
Journal of ZheJiang University (Engineering Science)    2023, 57 (1): 155-169.   DOI: 10.3785/j.issn.1008-973X.2023.01.016
Abstract   HTML PDF (1100KB) ( 695 )  

There has been a trend of intelligent development using artificial intelligence technology in order to improve the efficiency of software development. It is important to understand program semantics to support intelligent development. A series of research work on program representation learning has emerged to solve the problem. Program representation learning can automatically learn useful features from programs and represent the features as low-dimensional dense vectors in order to efficiently extract program semantic and apply it to corresponding downstream tasks. A comprehensive review to categorize and analyze existing research work of program representation learning was provided. The mainstream models for program representation learning were introduced, including the frameworks based on graph structure and token sequence. Then the applications of program representation learning technology in defect detection, defect localization, code completion and other tasks were described. The common toolsets and benchmarks for program representation learning were summarized. The challenges for program representation learning in the future were analyzed.

Table and Figures | Reference | Related Articles | Metrics
Three-dimensional target inversion algorithm based on multi-feature reconstruction
Yali XUE,Lizun ZHOU,Linfei WANG,Quan OUYANG
Journal of ZheJiang University (Engineering Science)    2024, 58 (11): 2199-2207.   DOI: 10.3785/j.issn.1008-973X.2024.11.001
Abstract   HTML PDF (2281KB) ( 321 )  

A 3D target inversion algorithm based on multi-feature reconstruction was proposed in order to solve the problems of large memory occupation and time-consuming training in deep learning-based three-dimensional inversion methods. Four types of features, horizontal area, center depth, vertical thickness and residual density of the target were extracted by feature decomposition to realize the compression of the three-dimensional model and reduce the memory occupation. The multi-feature reconstruction of inversion network (MRNet) was designed to realize the prediction of the four types of target features by different Decoder, and the four types of features were used to reconstruct the three-dimensional model to realize the inversion of the 3D target. The gradient union was introduced at the input of the network to realize the enhancement of target boundary information. The CA attention mechanism was introduced at the cross-layer connection to realize the differentiation of Decoder’s prediction function and optimize the inversion effect. The simulation results showed that the local relative accuracy of MRNet was improved by more than 30% compared with 3D U-Net, reaching 88.91%, and the training time per round was only 1/13 of 3D U-Net. MRNet was applied to Vinton Salt Mound, and the distribution of caprocks was obtained more accurately, which verified that MRNet had certain generalizability.

Table and Figures | Reference | Related Articles | Metrics
Choice of innovation type for China's industrial green transformation under environmental regulation
Haiying LIU,Xianzhe CAI
Journal of ZheJiang University (Engineering Science)    2024, 58 (1): 188-196.   DOI: 10.3785/j.issn.1008-973X.2024.01.020
Abstract   HTML PDF (707KB) ( 429 )  

A super-efficient SBM model including non-desired outputs was used to measure industrial environmental efficiency in 30 Chinese provinces from 2008 to 2020 in order to solve the problem of how industrial enterprises can pick appropriate green technology innovations to accomplish industrial green transformation under the background of strict environmental regulations. The efficiency was used to characterize the level of industrial green transformation. A panel threshold model was used to explore the mechanism of the impact of different green technology innovations on industrial green transformation under different environmental regulation intensities. Results show that China's industrial environmental efficiency fluctuates and rises from 2008 to 2020 as a whole, and the efficiency gap between regions shows a slightly decreasing trend. The environmental impacts of various green technology innovations significantly differ, among which process-oriented green technology innovations emphasizing on processes and products is the key to achieving industrial green transformation. The positive environmental effect of process-oriented green technology innovation increases, while the negative environmental effect of result-oriented green technology innovation decreases as environmental regulations become more stringent.

Table and Figures | Reference | Related Articles | Metrics
LIU Yun-Hai, LIN Yu-
null    2009, 43 (4): 710-715+742.  
Abstract   PDF (1046KB) ( 1728 )  
Related Articles | Metrics
Review of Chinese font style transfer research based on deep learning
Ruo-ran CHENG,Xiao-li ZHAO,Hao-jun ZHOU,Han-chen YE
Journal of ZheJiang University (Engineering Science)    2022, 56 (3): 510-519, 530.   DOI: 10.3785/j.issn.1008-973X.2022.03.010
Abstract   HTML PDF (874KB) ( 755 )  

The research works of Chinese font style transfer were classified according to different stages of research development. The traditional methods were briefly reviewed and the deep learning-based methods were combed and analyzed. The commonly used open data sets and evaluation criteria were introduced. The future research trends were expected from four aspects, which were to improve the generation quality, enhance personalized differences, reduce the number of training samples, and learn calligraphy font style.

Table and Figures | Reference | Related Articles | Metrics
Dead band effect and compensation for return-free power control of dual active bridge
Guopeng ZHANG,Chuangchuang JIANG,Haijun TAO,Zhuo CHEN
Journal of ZheJiang University (Engineering Science)    2024, 58 (11): 2406-2416.   DOI: 10.3785/j.issn.1008-973X.2024.11.022
Abstract   HTML PDF (3254KB) ( 136 )  

A dead band compensation strategy for CTPS control was proposed aiming at the problem that the occurrence of return power as well as the failure of soft switching was caused after the addition of bridge arm dead band to the triple phase-shift cooperative control (CTPS) of dual active bridge (DAB) converter. The coupling relationships between different mode shift ratios and the power transfer model and the switching conditions of CTPS control modes were corrected based on the principle of return power generation by analyzing the changes of transformer primary and secondary side voltages and leakage currents caused by the dead band of the bridge arm in different modes of CTPS control. Then the effective control of the impact of dead band on CTPS control was realized. The proposed compensation scheme suppressed the return power caused by the dead band, restored the soft-switching performance of the CTPS control, and had better current stress than before compensation. Experiments before and after dead band compensation were conducted separately to verify the analysis of the dead band effects and the proposed compensation strategy.

Table and Figures | Reference | Related Articles | Metrics
Review of data-driven intelligent computation and its application
Rui DAI,Jing JIE,Wanliang WANG,Qianlin YE,Fei WU
Journal of ZheJiang University (Engineering Science)    2025, 59 (2): 227-248.   DOI: 10.3785/j.issn.1008-973X.2025.02.002
Abstract   HTML PDF (1476KB) ( 75 )  

State-of-the-art data-driven intelligent computations (DDICs) were comprehensively reviewed in order to effectively solve the increasingly complex and expensive optimization problems (EOPs) emerging in real-world applications, which can effectively reduce computing costs and improve solutions. The latest research achievements of DDICs were outlined from both algorithm and application perspectives. Various technical points in generalized DDICs and adaptive DDICs were summarized and categorized. The challenges and opportunities faced by DDICs in solving EOPs were analyzed. Future research potential trends were proposed, such as conducting deeper theoretical analyses, exploring novel learning paradigms, applying these methods in various practical fields, and so on. This aims to provide targeted references and directions for researchers, stimulating innovative ideas to more effectively address the complex EOPs encountered in real-world applications.

Table and Figures | Reference | Related Articles | Metrics
Open-set 3D model retrieval algorithm based on multi-modal fusion
Fuxin MAO,Xu YANG,Jiaqiang CHENG,Tao PENG
Journal of ZheJiang University (Engineering Science)    2024, 58 (1): 61-70.   DOI: 10.3785/j.issn.1008-973X.2024.01.007
Abstract   HTML PDF (993KB) ( 404 )  

An open domain 3D model retrieval algorithm was proposed in order to meet the requirement of management and retrieval of massive new model data under the open domain. The semantic consistency of multi-modal information can be effectively used. The category information among unknown samples was explored with the help of unsupervised algorithm. Then the unknown class information was introduced into the parameter optimization process of the network model. The network model has better characterization and retrieval performance in the open domain condition. A hierarchical multi-modal information fusion model based on a Transformer structure was proposed, which could effectively remove the redundant information among the modalities and obtain a more robust model representation vector. Experiments were conducted on the dataset ModelNet40, and the experiments were compared with other typical algorithms. The proposed method outperformed all comparative methods in terms of mAP metrics, which verified the effectiveness of the method in terms of retrieval performance improvement.

Table and Figures | Reference | Related Articles | Metrics
Predefined time adaptive sliding mode control for flexible space robot
Yicheng LIU,Jialing YANG,Rui TANG,Jing CHENG
Journal of ZheJiang University (Engineering Science)    2025, 59 (2): 351-361.   DOI: 10.3785/j.issn.1008-973X.2025.02.013
Abstract   HTML PDF (2919KB) ( 70 )  

An adaptive sliding mode control method based on predefined time was proposed for the trajectory tracking control problem of a flexible space robot with typical nonlinear characteristics. The dynamic model of the multi-stage cable-driven flexible space robot was established by using the constant curvature method and Lagrangian formulation. A sliding mode controller based on predefined time theory was designed. A radial basis function (RBF) neural network was employed to compensate for modeling errors and external disturbances in the multi-stage cable-driven flexible space robot system. The convergence of trajectory tracking error within predefined time was proven using Lyapunov theory. The effectiveness of the model and controller was verified through numerical simulations. Comparative analysis against fixed-time controllers and uncompensated controllers showed that the proposed controller facilitated faster convergence of system trajectory error.

Table and Figures | Reference | Related Articles | Metrics
Three-dimensional sector automatic design based on improved NSGA-II algorithm
Yingfei ZHANG,Xiaobing HU,Hang ZHOU,Xuzeng FENG
Journal of ZheJiang University (Engineering Science)    2025, 59 (2): 413-422.   DOI: 10.3785/j.issn.1008-973X.2025.02.019
Abstract   HTML PDF (1634KB) ( 70 )  

An improved non-dominated sorting genetic algorithm II (NSGA-II) was proposed in order to address the challenges of time-consuming manual airspace sectorization and the difficulty in comparing the quality of different sectorization schemes. A three-dimensional multi-objective optimization model for sectorization was established by using a grid-region-sector hierarchy in order to balance controllers’ workload within sectors and reduce workload differences between sectors. A fitness evaluation operator, a probability-adaptive combination crossover operator and a dynamic mutation operator were incorporated in the NSGA-II algorithm in order to enhance the number of feasible solutions, solution diversity and computational efficiency. A simulation was conducted for the automatic 3D sectorization of Xi'an high-altitude airspace. Results showed that the optimized scheme improved workload balance within sectors by 37% and reduced inter-sector workload by 24% compared with the current sectorization configuration. The proposed improved NSGA-II provided a broader range of options for decision-makers with varying preferences compared with traditional weighted multi-objective optimization algorithms.

Table and Figures | Reference | Related Articles | Metrics