GitHub - liufeng2317/DispFormer: A pretrained dispersion curve inversion network that can handle dispersion data of any length, offering flexibility and efficiency for a wide range of applications.

DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Local Applications

📚 Introduction
🔧 Installation
🎬 Examples
📧 Contact & Citation

📚 Introduction

DispFormer is a transformer-based neural network for inverting S-wave velocity ($v_s$) profiles from Rayleigh-wave phase and/or group dispersion curves. It processes data independently at each period, allowing it to handle datasets with varying lengths and missing data without needing alignment or network modifications. The model works well in zero-shot and few-shot learning scenarios, making it highly efficient and accurate for real-world applications, even when data is sparse or noisy.

🔧 Installation

(1) Clone the repository:

git clone https://github.com/liufeng2317/DispFormer 
cd DispFormer

(2) requirements

Python >= 3.8
PyTorch >= 1.9.0
Other dependencies listed in requirements.txt

pip install -r requirements.txt

(3) Introduction of the file list

DispFormer/model/dispformer.py: Contains the model architecture for DispFormer.
DispFormer/dataloader.py: Implements dataset handling using torch.utils.data.Dataset.
DispFormer/dataloader_pretrain.py: Provides specialized datasets for pre-training purposes.
DispFormer/data_augmentation.py: Includes functions for dataset augmentation (e.g., Gaussian noise, masking, etc.).
DispFormer/plots.py: Contains functions for generating result visualizations.

🎬 Examples

(1) Datasets Overview

The LITHO1.0 global synthetic dataset is used for pre-training the model, while the Central and Western US Dataset (CWD) and Continental China Dataset (CCD) are used to validate the effectiveness of the zero-shot and few-shot strategies. Finally, the datasets retrive from China Seismological Reference Model (CSRM) are used to test the model. The datasets can be retrieved from the following Zenodo link: https://zenodo.org/records/14619577

Dataset	Samples	Period	Max Depth	Tags	Reference
LITHO1.0	40,962	1-100 s	200 km	Global Synthetic	Masters et al., 2014
CWD	6,803	10-60 s	120 km	Local Synthetic	Shen et al., 2013
CCD	4,527	5-80 s	200 km	Local Synthetic	Shen et al., 2016
CSRM	12,705	8-70 s	120 km	Local Real	Xiao et al., 2024

Synthetic-data Overview (different period range)

Real-data Overview (varying period range, missing data, low signal-to-noise ratio)

(2) Pre-training

We pre-train the DispFormer model using the LITHO1.0 global synthetic dataset, which includes a variety of seismic data. The pre-training process consists of two approaches:

Without data augmentation: Training on the original LITHO1.0 dataset to learn the model structure.
- Training script: 01_1_LITHO1.py
With data augmentation: Training on augmented datasets that include techniques like noise addition and masking.
- Training script: 01_2_LITHO1_dataAug.py

Once pre-trained, the model can be applied to datasets of varying lengths, including CWD and CCD datasets, to test its zero-shot performance.

(3) Zero-shot Inversion

DispFormer support directly applied the pre-trained model to regional datasets (CWD & CCD). The results show that the zero-shot strategy yields errors that are closely centered around zero, with few outliers, outperforming the interpolated reference model from LITHO1.0. However, considering domain gap between the training and testing datasets, zero-shot performance may not always surpass traditional optimization methods like global search. Therefore, we propose using the zero-shot DispFormer as an initial model generator to enhance traditional inversion algorithms.

Click to view Zero-Shot results CWD Dataset (Shen et al., 2013) results

Click to view Zero-Shot results on CCD Dataset (Shen et al., 2016) results

(4) Few-shot Inversion

When limited labeled data is available for fine-tuning the pre-trained model, the results can surpass traditional inversion methods. The results from the CWD and CCD datasets show that even with just a few dozen labels, fine-tuning can achieve performance comparable to global search algorithms. More details can be found in the following scripts:

Click to view Few-Shot results on CWD Dataset (Shen et al., 2013)

Click to view Few-Shot results on CCD Dataset (Shen et al., 2016)

(5) Real-Case Study

Real-world data often face challenges such as varying data ranges, missing data, and low signal-to-noise ratios, which can hinder the direct application of CNN/FNN-based networks. DispFormer, however, can be applied to real data without requiring modifications to the network structure or alignment between observation and training datasets. Inversion results show that zero-shot DispFormer provides acceptable results for most major structures. Additionally, after fine-tuning with a small amount of labeled data, the few-shot DispFormer generates results that are comparable to or even exceed the reference model. In terms of data error, the Few-Shot DispFormer outperforms the reference model.

Click to view inversion results on CSRM (Xiao et al., 2024)

Click to view data-residual comparison

📧 Contact & Citation

This project was developed by Feng Liu at Shanghai Jiao Tong University (SJTU). For any inquiries, please contact Liu Feng via email [email protected].

If you find DispFormer or its related publicly available datasets useful in your research, please cite the following papers

DispFormer

Liu, F., Deng, B., Su, R., Bai, L. & Ouyang, W.. DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications[J]. arXiv preprint arXiv:2501.04366, 2025.

Surface Wave Dispersion Benchmark Datasets:

Liu, F. (2025). Surface Wave Dispersion Benchmark Datasets: Synthetic and Real-World Cases (1.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.14619577

and/or the software

@software{DispFormer_LiuFeng_2024,
  author       = {Feng Liu},
  title        = {DispFormer},
  month        = January,
  year         = 2024,
  version      = {v1.0.0},
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
DispFormer		DispFormer
Figures		Figures
Script/ModelDesign		Script/ModelDesign
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Local Applications

📚 Introduction

🔧 Installation

🎬 Examples

(1) Datasets Overview

(2) Pre-training

(3) Zero-shot Inversion

(4) Few-shot Inversion

(5) Real-Case Study

📧 Contact & Citation

About

Releases

Packages

Languages

License

liufeng2317/DispFormer

Folders and files

Latest commit

History

Repository files navigation

DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Local Applications

📚 Introduction

🔧 Installation

🎬 Examples

(1) Datasets Overview

(2) Pre-training

(3) Zero-shot Inversion

(4) Few-shot Inversion

(5) Real-Case Study

📧 Contact & Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages