Generating and Feeding a Custom Dataset into T-GCN #583

MortezaBabazadehShareh · 2025-02-05T15:09:28Z

Hello,

Thank you for this implementation of T-GCN! I am trying to generate my own random dataset and feed it into the TGCN network instead of the METRLA dataset (in traffic_prediction.jl example). However, I am running into an issue where the model's output is NaN, both before and after training.

Could you please provide guidance on how to properly format a custom dataset and ensure it works correctly with the model?

Specifically:

What structure should the custom dataset follow to be compatible with TGCN?
Are there any specific pre-processing steps I need to follow when creating the dataset?
Are there common pitfalls that could cause NaN values in the model output?

Any function, advice or example code for using a different dataset would be greatly appreciated.

Thanks in advance!

CarloLucibello · 2025-02-06T08:04:43Z

This is an example with fixed graph topology and time-varying node features

julia> using GraphNeuralNetworks

julia> num_nodes, num_edges = 5, 10;

julia> d_in, d_out = 2, 3;

julia> timesteps = 5;

julia> g = rand_graph(num_nodes, num_edges);

julia> x = rand(Float32, d_in, timesteps, num_nodes);

julia> layer = TGCN(d_in => d_out)
GNNRecurrence(
  TGCNCell(2 => 3),                     # 126 parameters
)                   # Total: 18 arrays, 126 parameters, 1.469 KiB.

julia> y = layer(g, x);

julia> size(y) # (d_out, timesteps, num_nodes)
(3, 5, 5)

julia> all(isfinite, y)
true

Normally the layer would be part of a larger model, with some node embedding/projection at the beginning and a classification/regression head at the end.

The fact that you observe NaNs at initialization is very weird. Can you provide an example?

MortezaBabazadehShareh · 2025-02-06T15:44:40Z

Actually, I want to generate a dataset with the same structure as the following code:

dataset_metrla = METRLA(; num_timesteps = 3) 
g = dataset_metrla[1]
features = g.node_data.features
targets = g.node_data.targets

In this case, the shapes of features and targets follow:

features[observations#][features# , nodes# , num_timesteps#]
targets[observations#][targets# , nodes# , num_timesteps#]

For example, in the METRLA dataset:

features# = 2
targets# = 1
nodes# = 207
num_timesteps = 3  #here
observations# = 34,269

To generate a random dataset with the same structure, I used the following function:

function generate_dataset(num_nodes = 20, num_edges = 30, num_observations = 200, num_timesteps = 3 , num_features = 2, num_targets = 1)

    # Data
    features = [rand(Float32, num_features, num_nodes, num_timesteps )  for _ in 1:num_observations]
    targets = [rand(Float32, num_targets, num_nodes, num_timesteps ) for _ in 1:num_observations]

    # Generate a random graph
    g = rand_graph(num_nodes, num_edges)

    # Extract edge list and properly format it
    first = zeros(Int64, num_edges)
    second = zeros(Int64, num_edges)
    for i in 1:num_edges
        first[i] = Graphs.edges(g)[i].src
        second[i] = Graphs.edges(g)[i].dst
    end
    
    edge_index = (first, second)
    edge_data = rand(Float32, num_edges)  # Assign random edge weights


    graph = GNNGraph(edge_index, edata = edge_data, num_nodes = num_nodes)

    # Split dataset
    train_loader = zip(features[1:Int(num_observations * 0.7)], targets[1:Int(num_observations * 0.7)])
    test_loader = zip(features[Int(num_observations .* 0.7) + 1:num_observations], targets[Int(num_observations .* 0.7) + 1:num_observations])

    return graph, train_loader, test_loader, features, targets
end

However, despite following this structure, I am still encountering NaN values in the model output. Could you confirm whether this dataset structure is correctly aligned with what T-GCN expects? Additionally, are there any specific pre-processing steps or normalizations required to avoid NaN values?

CarloLucibello · 2025-02-07T09:53:33Z

Let me first say that in GNN.jl v1.0 the temporal convolutions have been changed and now the temporal dimension is the second to last dimension.

The tutorials have been updated only recently, one has to look at the dev version of the documentation https://juliagraphs.org/GraphNeuralNetworks.jl/docs/GraphNeuralNetworks.jl/dev/tutorials/traffic_prediction/

For the dataset, I would create a custom class. Also i don't see NaNs in the following training code:

using GraphNeuralNetworks, Flux

struct Dataset{D}
    data::D
    length::Int
    num_timesteps::Int
end

function Dataset(; num_features, num_nodes, total_length, num_timesteps)
    data = rand(Float32, num_features, total_length, num_nodes)
    length = total_length - num_timesteps + 1
    return Dataset(data, length, num_timesteps)
end

Base.length(d::Dataset) = d.length

function Base.getindex(d::Dataset, i::Int)
    (1 <= i <= length(d)) || throw(ArgumentError("Index out of bounds."))
    x = d.data[:, i:i+d.num_timesteps-1, :]
    # the target is the first feature at next time step
    y = d.data[1, i+1:i+d.num_timesteps, :]
    return x, y
end

Base.getindex(d::Dataset, is::AbstractVector) = [d[i] for i in is]

num_timesteps = 2
num_nodes = 3
num_features = 2

d = Dataset(; num_features, num_nodes, total_length=1000, num_timesteps)
g = rand_graph(num_nodes, 6)

train_data = d[1:100]
val_data = d[101+num_timesteps:200]
train_loader = DataLoader(train_data, shuffle=true, batchsize=-1)
val_loader = DataLoader(val_data, batchsize=-1)

model = GNNChain(Dense(num_features => 64, relu), 
                 TGCN(64 => 64), 
                 x -> relu.(x), 
                 TGCN(64 => 64), 
                 x -> relu.(x), 
                 Dense(64, 1), 
                 flatten)

opt_state = Flux.setup(Flux.AdamW(0.001), model)

for epoch in 1:10
    for (x, y) in train_loader
        grads = Flux.gradient(m -> Flux.mse(model(g, x), y), model)
        Flux.update!(opt_state, model, grads[1])
    end
end

CarloLucibello added temporalgraphs gnngraphs labels Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generating and Feeding a Custom Dataset into T-GCN #583

Generating and Feeding a Custom Dataset into T-GCN #583

MortezaBabazadehShareh commented Feb 5, 2025

CarloLucibello commented Feb 6, 2025

MortezaBabazadehShareh commented Feb 6, 2025 •

edited

Loading

CarloLucibello commented Feb 7, 2025 •

edited

Loading

Generating and Feeding a Custom Dataset into T-GCN #583

Generating and Feeding a Custom Dataset into T-GCN #583

Comments

MortezaBabazadehShareh commented Feb 5, 2025

CarloLucibello commented Feb 6, 2025

MortezaBabazadehShareh commented Feb 6, 2025 • edited Loading

CarloLucibello commented Feb 7, 2025 • edited Loading

MortezaBabazadehShareh commented Feb 6, 2025 •

edited

Loading

CarloLucibello commented Feb 7, 2025 •

edited

Loading