2025年06月

Bao Shengqi, graduated from Shanghai University as an undergraduate and started his academic master’s degree in the School of Computer Engineering and Science at Shanghai University in 2022. Bao Shengqi has been studying in Yuexing Han’s research group since his senior year of undergraduate study to research on image processing related technologies and applications, and has continued and advanced the following research under the careful guidance of Mr Han:

  1. A lightweight deep learning method based on unimodal feature decoupling is proposed to solve the task of predicting the authenticity of calcite PUF anti-counterfeiting labels in view of the complexity of multilevel features in functional material images. The feature decoupling strategy separates the learning of macroscopic and microscopic features into independent stages, which reduces the interference between different layers of features. To meet the high efficiency and low latency requirements, a lightweight divergent convolution network is designed, whose core divergent convolution mechanism significantly reduces the computational complexity. At the same time, the extra angle boundary loss function is used to enhance the discriminative ability of the features by increasing the inter-class distance and reducing the intra-class differences. In order to achieve effective rejection of unknown samples, a geodesic metric is proposed to predict the sample authenticity based on the geodesic distance in high-dimensional manifold space.

  2. Aiming at the problem of difficult fusion of multimodal heterogeneous data features of functional materials, a lightweight deep learning method based on multimodal feature fusion is proposed to solve the task of predicting the properties of calcium alginate/graphene composites. By designing a table-driven feature fusion network, two heterogeneous data sources, namely image and table, are effectively integrated. The network contains two core modules: form-guided visual semantic enhancement module and gated feature fusion module. The table-guided visual semantic enhancement module uses table information to guide visual feature learning for deep cross-modal alignment and enhancement; the gated feature fusion module performs effective inter-modal interaction, alignment and adaptive weighted fusion through cross-attention and gating mechanisms. Meanwhile, to ensure computational efficiency, the network adopts lightweight design such as shunt convolution and Mamba.

During his three years of postgraduate study at Shanghai University, Bao Shengqi has worked hard to enhance his professional knowledge, and has made many good friends and mentors. We hope that Bao Shengqi will not forget his original intention and keep his mission in mind in the future, and that he will overcome the obstacles and move forward.

Essay: Research on Feature Processing and Prediction Methods for Functional Material Data

Read

Ruan Liheng graduated from Shanghai University with a bachelor’s degree and began his master’s degree in September 2022 at the School of Computer Engineering and Science, Shanghai University. After joining the research group, he studied shape space theory and image generation technologies and applications under the guidance of Professor Han Yuexing. Under Professor Han’s careful guidance, he completed the following research:

  1. An image generation method based on the migration of information from pre-shaped spatial geodesic surfaces is proposed to address the challenges faced by image generation models when there are too few training samples and a lack of applicable pre-training models. The method overcomes the bottleneck of the model’s difficulty in effectively learning the distribution of very few samples and aims to generate high-quality and diverse images. The core process is as follows: firstly, the depth features of a small number of samples are extracted and used to construct a geodesic surface in pre-shape space for nonlinear feature enhancement; then, a pseudo-source domain is constructed based on the enhanced features to simulate a rich data distribution, and information migration from the pseudo-source domain to the target domain is carried out; finally, interpolated supervisory and regularisation constraints are imposed at the information migration stage for optimisation. Experiments demonstrate that, compared with existing methods, this method significantly improves the quality, detail richness and diversity of the generated images on multi-domain datasets, effectively mitigates pattern collapse, and demonstrates the potential of its generated images in assisting downstream tasks.

  2. To address the challenges of the text-guided zero-sample image style migration task, a zero-sample style migration method based on the enhancement of geodesic surface features in the pre-shape space is proposed. The method aims to efficiently inject external novel style information into the pre-trained model while ensuring style consistency and content accuracy. Specifically, the method applies the idea of geodesic feature enhancement to the style migration framework based on the pre-trained diffusion model, combines with sliding window cropping to process the local information, and facilitates the effective fusion of the style and content features in the pre-shape space using the geodesic feature enhancement module. Experiments show that the method can achieve flexible text-guided style control without additional model fine-tuning or style references, and can better maintain the original content structure when generating images with the target style compared to the comparison model.

After graduation, Ruan Liheng will join Huawei. Looking back on his three years in Shanghai University, he has studied hard, conducted serious research, continued to improve his professional ability, and also formed a deep friendship with many teachers and friends. We hope that Ruan Liheng will take this experience and gain, not forgetting his original intention, and will ride the waves of the wind and have a bright future in the future.

Essay: Research and Application of Few-Shot Image Generation Method with Feature Augmentation Based on The Shape Space Theory

Code: https://github.com/P2i42/FAGStyle

Read

Zhang Yilin graduated from Heilongjiang University with a bachelor’s degree. In 2022, she began her master’s degree in professional studies at the School of Computer Engineering and Science, Shanghai University. After joining the research group, she followed the guidance of Professors Zhang Rui, Han Yuexing, and Chen Qiaochuan to study material literature information mining methods. Under the careful guidance of her teachers, she completed the following research:

  1. Aiming at the problems of long sequence dependency and complex entity relationships in materials literature, a semantically enhanced graph network model is proposed and applied to the literature mining field of composite materials. The model enhances semantic association modelling by constructing a heterogeneous graph and introduces a chunked attention mechanism to efficiently deal with the long sequence problem, effectively overcoming the limitations of traditional models. On this basis, the model uses deep separable convolution to fuse global and local semantic features, and combines the dynamic edge weighting mechanism and deep scoring network to improve the node representation and recognition accuracy, and more effectively captures the semantic relationships of material terms in complex contexts.

  2. Aiming at the problems of fuzzy entity boundaries and poor recognition of long entities in general-purpose material texts, a multi-granularity fusion graph network model is proposed to be applied to the task of named entity recognition in the field of material science literature. The model is designed with a new module that fuses multi-granularity semantic features with boundary optimisation strategies: firstly, it enhances the representation ability of semantic features at different scales through gated fusion and cross-granularity interactive attention; secondly, it combines Conditional Random Fields and Contrastive Learning for joint training to synergistically improve the accuracy of boundary recognition and the performance of long entity recognition by taking advantage of their respective strengths.

  3. The proposed literature mining method is applied to the prediction of carbon fibre composite material properties and application design. By mining and screening the material experimental literature, nine types of key features closely related to mechanical properties are extracted, and the potential of the literature mining results in performance modelling is experimentally verified. In addition, a material performance prediction system was designed and implemented to support users to upload data files and complete model selection, training and result visualisation, providing an efficient and easy-to-use performance prediction tool for material researchers.

Zhang Yilin joined Alibaba Group after graduation and worked as a software developer. During her postgraduate studies at Shanghai University, she studied hard to enhance her professional knowledge and research ability, and was fortunate to meet many mentors and friends. We hope that Zhang Yilin will not forget her original intention and keep her mission in mind in the future, and that she will overcome the obstacles and move forward.

Essay: Research on Material Literature Mining Methods Based on Semantic Awareness

Code: https://github.com/han-yuexing/2025-thesis-zyl-code

Read

Wang Hui graduated from Yanbian University and began her master’s degree in the School of Computer Engineering and Science at Shanghai University in September 2022. After joining the research group, she studied natural language processing and related technologies and applications under the guidance of Professor Han Yuexing, completing the following research:

  1. In order to exploit the potential of large language models for entity extraction in scientific literature scenarios, a context-consistent explicit entity annotation method with a two-stage training approach is proposed to address the discrepancy between the generative output of the large language model and the sequential annotation characteristics of the named entity recognition task. Then, the training phase is divided into two phases: supervised fine-tuning and direct preference optimisation. The supervised fine-tuning phase first learns the basic entity recognition ability on the existing annotation data; the direct preference optimisation phase tries to expand and contract the entity boundaries in negative sample construction in order to guide the model to correct the errors more efficiently, and screens the reasoning results of supervised fine-tuning to create category confusing samples, and enhances the model to respond to the errors by utilising the positive and negative samples pairs of preference difference to constrain and enhance the model’s ability to correct erroneous determinations.

  2. In order to solve the problem of insufficient recognition accuracy of named entities due to a large number of low-frequency terminology when general-purpose models deal with highly specialised domains such as materials science and biomedicine, this paper proposes a semantic fusion method based on domain language models, which enhances the deeper semantic understanding of scientific literature by semantically fusing different domain language models and domain word-level vectors, and verifies the effectiveness of the method on the complex specialised texts in materials science and biomedicine fields by experiments. The effectiveness of the method is verified experimentally for complex specialised texts in the fields of materials science and biomedicine. Finally, the method is applied to specific fields and three kinds of high hardness alloys are designed to show its practical value in scientific text mining and assisting R&D decision-making.

After graduation, Wang Hui joined Vivo Mobile Communications. During her three-year postgraduate career at Shanghai University, Hui Wang studied hard and participated in research projects. She was able to quickly analyse and propose effective solutions to complex problems, showing strong independent research ability and innovative consciousness. We hope that Hui Wang will not forget her original intention, overcome the obstacles and move forward in the future.

Essay: Research and Applications of Named Entity Recognition for Scientific Literature

Code: https://github.com/han-yuexing/2025-thesis-wh-code

Read

Our team published the paper “A projection module based on the shape space theory for small-sample image processing”. The School of Computer Engineering and Science of Shanghai University is the first unit of the paper, Hu Gan is the first author, and Han Yuexing is the corresponding author.

The “pre-training” plus “fine-tuning” paradigm provides an effective tool for neural network image processing with limited datasets. This approach compensates for the lack of information in small target datasets by pre-training models on large source datasets. However, when the target dataset is further reduced to a small sample size, the existing method is difficult to maintain the performance of the migration model, resulting in a sharp deterioration of the effect. To overcome this shortcoming, this paper proposes PMSS, a projection module based on shape space theory, to enhance the capability of migration models in small sample scenarios.

We first pre-train the model on the source dataset and save it, and subsequently extract features from the target dataset using the pre-trained model. These features, which are originally in Euclidean space, are projected to the pre-shape space for subsequent training via PMSS. In addition, a class-aware attention mechanism is introduced during the learning process to enhance the model’s ability to handle small-sample tasks through enhanced feature representation. Extensive experiments on 10 backbone networks and 5 datasets demonstrate the effectiveness of PMSS: 6%, 8%, and 13% accuracy improvements are achieved on CIFAR10, CIFAR100, and their small-sample subsets, respectively.PMSS adopts a plug-and-play design, which allows it to be directly applied to real-world finite-data systems without the need to change the network architecture. PMSS achieves state-of-the-art performance on small-sample tasks compared to the latest stream learning methods and robust migration learning methods.

Essay: A projection module based on the shape space theory for small-sample image processing

Code: https://github.com/hg18855467337-del/PMSS

Read
2025年03月

Our team published the paper “Deep Learning-Based Framework for Efficient Design of Multicomponent High Hardness High Entropy Alloys” in the journal ACS Applied Materials & Interfaces (IF: 8.3, CAS Region II). The School of Computer Engineering and Science of Shanghai University is the first department of the paper, with Yuexing Han as the first author, Hui Wang as the second author, and Yi Liu as the corresponding author.

In the field of materials science, high-entropy alloys (HEAs) have become a research hotspot due to their excellent properties. However, finding an optimal design that is both innovative and reliable in the vast alloy composition space poses a huge challenge. Traditional trial-and-error methods are inefficient, while purely data-driven approaches struggle to ensure the practical performance of designs. To address this issue, we propose a deep learning-based framework that combines domain knowledge in materials science with data-driven techniques to optimize the design process of multicomponent high-hardness high-entropy alloys.

First, we developed a Materials Cascade Embedding (MCE) module and integrated it with the BiLSTM-CRF network (MCE-BILSTM-CRF) to automatically analyze 2,698 papers published in the past 5 years and extracted 8,067 data points. By incorporating materials domain knowledge into the data analysis, we identified high-potential elements and critical process conditions to guide the design and construction of machine learning datasets. After manually summarizing and organizing the target literature, we constructed a hardness dataset containing 13 elements. Based on this, we utilize a two-stage design strategy combining genetic algorithm (GA) and particle swarm optimization (PSO) to develop multi-component high-entropy alloys. The first stage explores the alloy system and the second stage optimizes the component ratios, thereby promoting innovation and performance enhancement. Our analysis combines SHAP feature significance and Pearson correlation coefficients (PCCs), complemented by materials domain knowledge, to validate the findings and guide the selection of alloy systems. In the end, we successfully designed three high-entropy alloys that differed from the existing dataset and predicted an average relative hardness error of less than 5%, with the optimal alloy being only 38 HV lower than the historical record.

Essay: Deep Learning-Based Framework for Efficient Design of Multicomponent High Hardness High Entropy Alloys

Read
2025年01月

Our team published the paper “Fast and Accurate Recognition of Perovskite Fluorescent Anti-Counterfeiting Labels Based on Lightweight Convolutional Neural Networks” in the international journal ACS Applied Materials & Interfaces (IF:8.3, CAS Region II). Counterfeiting Labels Based on Lightweight Convolutional Neural Networks”. The School of Computer Engineering and Science of Shanghai University is the first department of the paper, with Yuexing Han as the first author, Bao Shengqi as the second author, Bozhi Shi as the third author, and Qiaochuan Chen as the corresponding author.

Anti-counterfeiting technology has always been a key issue in the field of information security. Physical unclonable function (PUF) labels, which are random patterns generated by a stochastic process, are an effective anti-counterfeiting strategy due to the inherent randomness of their physical patterns. In this study, a high-throughput droplet array generation technique based on surface tension constraints was developed for the preparation of chalcogenide crystal films with controllable shapes and sizes. The texture of PUF labels is constructed by utilizing the random distribution of chalcogenide nanocrystal grains. Compared with other anti-counterfeiting labels, the labels in this study not only have fluorescent properties, but also have micrometer size, low cost, and high encoding capacity, which provide support for multilevel anti-counterfeiting protection. In addition, this study introduces an innovative PUF recognition method based on Partial Convolutional Network (PaCoNet), which effectively addresses the limitations of previous methods in terms of recognition accuracy and speed. Experimental validation of a dataset of chalcocite nanocrystal films containing up to 60 different macro shapes and unique micro textures shows that the method in this study achieves recognition accuracy of up to 99.65% and significantly reduces the recognition time per image to only 0.177 seconds, highlighting the potential application of these tags in the field of anti-counterfeiting.

Essay: Fast and Accurate Recognition of Perovskite Fluorescent Anti-Counterfeiting Labels Based on Lightweight Convolutional Neural Networks

Read

Our team published the paper “Statistics and Analysis of Lath Martensite Transformation based on in-situ observation and video processing”. The School of Computer Engineering and Science, Shanghai University was the first department, Yuexing Han was the first author, Ruiqi Li was the second author, and Xiangyu Xu was the corresponding author.

Research methods in materials science are undergoing a new transformation with the development of artificial intelligence and scientific equipment. The shift from the traditional way of studying material properties based on static images to revealing the process of material microstructural changes through dynamic videos has not only enhanced the depth of research, but also dramatically improved the efficiency of data processing. Especially in the field of steel manufacturing, the study of the phase transition from austenite to martensite is crucial for optimizing material properties.

In this paper, we propose a dynamic video-based method for analyzing the phase transformation of slat martensite, which breaks through the limitations of static image research. The method can efficiently segment and extract the image data of individual slat martensite, and analyze its changing law in dynamic video. By counting several key attributes including the number, size, area and direction of the transformed slats, we realize a comprehensive analysis of the dynamic characteristics of martensite phase transformation. This method not only improves the information extraction efficiency, but also provides important data support for revealing the martensite phase transformation mechanism and optimizing the steel manufacturing process.

The results show that dynamic video studies can significantly improve the efficiency and accuracy of data processing, especially when facing the complex morphology and rapid transformation process of slat martensite. In the future, I would like to apply this method to the study of more material systems to further promote the optimization of material properties and process improvement.

Read

Our team published a paper “Automatic pipeline for information of curve graphs in papers based on deep learning” in the international journal 《International Journal of Machine Learning and Cybernetics》 (IF : 3.1, CAS Region III). The School of Computer Engineering and Science of Shanghai University was the first department, Yuexing Han is the first author, Jinhua Xia is the second author, and Qiaochuan Chen is the corresponding author.

In materials science and biomedical fields. Current academic database tools mainly focus on mining textual information, ignoring the valuable information presented in graphs and charts. Extracting information from a large amount of literature can help researchers quickly grasp the current state of development. Literature is a carrier of many forms of data, and most researchers focus only on textual content. Especially, like graphs, they contain a lot of key numerical information that is not expressed in other data. In this paper, we propose a method for extracting information from graphs in the literature. With this method, the numerical values and axis entities of curved graphs can be extracted from both graphs and text. Firstly, the curved graphs are cut out from the literature using Yolov5s. Then, the exact title text corresponding to each curve graph is matched by manipulating Sentence-Bert. After obtaining the title text, the X-axis and Y-axis names in the curve graphs were extracted using SCI-Bert. At the same time, techniques such as Optical Character Recognition (OCR) were used to automatically parse the numerical data reflected on the graphs. In addition, a number of principles are used to improve performance. We validated each step using a dataset of 6042 articles from Elsevier. With our method, the accuracy of graph detection and title matching are 96.4% and 95.8% respectively, which are both better than the initial model, proving the effectiveness of our method. The accuracy of entity and numerical data extraction is 76.3% and 28.2%, respectively. The experimental results show that our method is able to achieve large-scale knowledge extraction of curve diagrams from the literature.

Essay: Automatic pipeline for information of curve graphs in papers based on deep learning

Read
Last updated: 2025-09-04
Visits: 加载中...