GitHub - signeoAI/Time-MoE: Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Time-MoE (Model): the first work to scale time series foundation models up to 2.4 billion parameters (trained from scratch)

Time-300B (Dataset): the largest open-access time series data collection comprising over 300 billion time points ( spanning more than 9 domains)

Updates/News:

🚩 News (Oct 2024): Time-MoE introduction in Chinese is now available!

🚩 News (Oct 2024): Time-300B dataset is now available on 🤗 Hugging Face!

🚩 News (Oct 2024): Time-MoE (large) is now available on 🤗 Hugging Face!

🚩 News (Sept 2024): Time-MoE (base) is now available on 🤗 Hugging Face!

🚩 News (Sept 2024): Time-MoE preprint has been made available on arXiv!

Introduction

Time-MoE comprises a family of decoder-only time series foundation models with a mixture-of-experts architecture, designed to operate in an auto-regressive manner, enabling universal forecasting with arbitrary prediction horizons and context lengths of up to 4096.

Time-MoE Model Card

Model	Activated Params.	Total Params.
Time-MoE (base)	50M	113M
Time-MoE (large)	200M	453M
Time-MoE (ultra)	1.1B	2.4B

📚 Training Data

Time-300B dataset is available on 🤗 Hugging Face!

Here's an example of how to use this dataset:

import random
from time_moe.datasets.time_moe_dataset import TimeMoEDataset

ds = TimeMoEDataset('Time-300B')
seq_idx = random.randint(0, len(ds) - 1)
seq = ds[seq_idx]

This code snippet shows how to load a random data sequence from the Time-300B dataset. First, download the dataset to the local 'Time-300B' folder, import the TimeMoEDataset class from time_moe.datasets, instantiate the class, and finally retrieve a sequence using a random index.

🚀 Getting Started

Installation

Install Python 3.10+, and then install the dependencies:

pip install -r requirements.txt

Time-MoE requires transformers==4.40.1 .

[Optional but recommended] Install flash-attn for faster training and inference speeds with reduced memory usage.

pip install flash-attn==2.6.3

Making Forecasts

import torch
from transformers import AutoModelForCausalLM

context_length = 12
seqs = torch.randn(2, context_length)  # tensor shape is [batch_size, context_length]

model = AutoModelForCausalLM.from_pretrained(
    'Maple728/TimeMoE-50M',
    device_map="cpu",  # use "cpu" for CPU inference, and "cuda" for GPU inference.
    trust_remote_code=True,
)

# use it when the flash-attn is available
# model = AutoModelForCausalLM.from_pretrained('Maple728/TimeMoE-50M', device_map="auto", attn_implementation='flash_attention_2', trust_remote_code=True)

# normalize seqs
mean, std = seqs.mean(dim=-1, keepdim=True), seqs.std(dim=-1, keepdim=True)
normed_seqs = (seqs - mean) / std

# forecast
prediction_length = 6
output = model.generate(normed_seqs, max_new_tokens=prediction_length)  # shape is [batch_size, 12 + 6]
normed_predictions = output[:, -prediction_length:]  # shape is [batch_size, 6]

# inverse normalize
predictions = normed_predictions * std + mean

If the sequences are normalized already:

import torch
from transformers import AutoModelForCausalLM

context_length = 12
normed_seqs = torch.randn(2, context_length)  # tensor shape is [batch_size, context_length]

model = AutoModelForCausalLM.from_pretrained(
    'Maple728/TimeMoE-50M',
    device_map="cpu",  # use "cpu" for CPU inference, and "cuda" for GPU inference.
    trust_remote_code=True,
)

# use it when the flash-attn is available
# model = AutoModelForCausalLM.from_pretrained('Maple728/TimeMoE-50M', device_map="auto", attn_implementation='flash_attention_2', trust_remote_code=True)

# forecast
prediction_length = 6
output = model.generate(normed_seqs, max_new_tokens=prediction_length)  # shape is [batch_size, 12 + 6]
normed_predictions = output[:, -prediction_length:]  # shape is [batch_size, 6]

Evaluation

Prepare the benchmark datasets.

You can access the well pre-processed datasets from [Google Drive], then place the downloaded contents under ./dataset.

[Example] Running the follow command to evaluate on ETTh1.

python run_eval.py -d dataset/ETT-small/ETTh1.csv -p 96

Citation

🙋 Please let us know if you find out a mistake or have any suggestions!

🌟 If you find the Time-MoE models helpful in your research, please consider to star this repository and cite the corresponding paper:

@misc{shi2024timemoe,
      title={Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts}, 
      author={Xiaoming Shi and Shiyu Wang and Yuqi Nie and Dianqi Li and Zhou Ye and Qingsong Wen and Ming Jin},
      year={2024},
      eprint={2409.16040},
      archivePrefix={arXiv},
      url={https://arxiv.org/abs/2409.16040}, 
}

Related Resources

Foundation Models for Time Series Analysis: A Tutorial and Survey, in KDD 2024. [paper] [Tutorial]
What Can Large Language Models Tell Us about Time Series Analysis, in ICML 2024. [paper]
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects, in TPAMI 2024. [paper] [Website]
A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection, in TPAMI 2024. [paper] [Website]
Transformers in Time Series: A Survey, in IJCAI 2023. [paper] [GitHub Repo]

Acknowledgement

We appreciate the following GitHub repos a lot for their valuable code and efforts.

Time-LLM [repo]
TimeMixer [repo]
Time-Series-Library [repo]
Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series and Spatio-Temporal Data [repo]

License

This project is licensed under the Apache-2.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
figures		figures
time_moe		time_moe
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run_eval.py		run_eval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Updates/News:

Introduction

Time-MoE Model Card

📚 Training Data

🚀 Getting Started

Installation

Making Forecasts

Evaluation

Citation

Related Resources

Acknowledgement

License

About

Uh oh!

Releases

Packages

Languages

License

signeoAI/Time-MoE

Folders and files

Latest commit

History

Repository files navigation

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Updates/News:

Introduction

Time-MoE Model Card

📚 Training Data

🚀 Getting Started

Installation

Making Forecasts

Evaluation

Citation

Related Resources

Acknowledgement

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages