Sequence Modeling Benchmarks and Temporal Convolutional Networks (TCN)

This repository is a fork of Temporal Convolutional Networks, which implements the methods/experiments of An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling by Shaojie Bai, J. Zico Kolter and Vladlen Koltun:

@article{BaiTCN2018,
	author    = {Shaojie Bai and J. Zico Kolter and Vladlen Koltun},
	title     = {An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling},
	journal   = {arXiv:1803.01271},
	year      = {2018},
}

Quadrant Problem Experiment

To help my understanding, I've created this fork to add my own additional experiment that puts forward the following sequence prediction problem:

Suppose there are coordinates $(x_1,y_1)$, $(x_2, y_2)$, ... $(x_n, y_n)$ that correspond to $n$ characters depending on which plane the coordinate resides in:

A | C    A | C         A | C
-----  , ----- , ... , ----
B | ?    B | ?         B | ?

To prevent the possibility of perfect prediction, the 4th quadrant is denoted by "?". If the coordinate resides there, there is an equal probability of being one of A, B, or C.

For example:

${(x1,y1), (x2,y2), (x3,y3)} = {(-1,-2), (-4,5), (3,3)} =$ BAC

${(x1,y1), (x2,y2), (x3,y3)} = {(-1,-2), (-4,5), (3,-3)} =$ BA[one of {A,B,C}]

Setup

Create and activate a Python 3.8 virtual environment using pyenv:

pyenv install -v 3.8.14
pyenv virtualenv 3.8.14 tcn-3.8.14
pyenv activate tcn-3.8.14

Install requirements via Poetry:

poetry install

Usage

The TCN model can and does learn to improve predictions on the sequences:

poetry run python quadrant_test.py

Sample Output:

Train Epoch:  1 [   198/   800 (25%)]   Learning rate: 0.0040   Loss: 1.030636
Train Epoch:  1 [   398/   800 (50%)]   Learning rate: 0.0040   Loss: 0.832494
Train Epoch:  1 [   598/   800 (75%)]   Learning rate: 0.0040   Loss: 0.692009
Train Epoch:  1 [   798/   800 (100%)]  Learning rate: 0.0040   Loss: 0.701636

Test set: Average loss: 0.609160

Train Epoch:  2 [   198/   800 (25%)]   Learning rate: 0.0040   Loss: 0.615945
Train Epoch:  2 [   398/   800 (50%)]   Learning rate: 0.0040   Loss: 0.573940
Train Epoch:  2 [   598/   800 (75%)]   Learning rate: 0.0040   Loss: 0.496575
Train Epoch:  2 [   798/   800 (100%)]  Learning rate: 0.0040   Loss: 0.528395

Test set: Average loss: 0.533496

...

Train Epoch: 10 [   198/   800 (25%)]   Learning rate: 0.0040   Loss: 0.311580
Train Epoch: 10 [   398/   800 (50%)]   Learning rate: 0.0040   Loss: 0.314201
Train Epoch: 10 [   598/   800 (75%)]   Learning rate: 0.0040   Loss: 0.255762
Train Epoch: 10 [   798/   800 (100%)]  Learning rate: 0.0040   Loss: 0.355394

Test set: Average loss: 0.392151

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
TCN		TCN
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sequence Modeling Benchmarks and Temporal Convolutional Networks (TCN)

Quadrant Problem Experiment

Setup

Usage

Sample Output:

About

Releases

Packages

Languages

License

A-Pot/TCN

Folders and files

Latest commit

History

Repository files navigation

Sequence Modeling Benchmarks and Temporal Convolutional Networks (TCN)

Quadrant Problem Experiment

Setup

Usage

Sample Output:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages