Tags · gsavarela/ilurl

0.3.8

Upgrade to flow-0.5.0.dev & remove examples(#15)

Feb 1, 2020
8a37044
zip
tar.gz

0.3.7

Update 0x01/README.latex plus additions (#20)

Jan 30, 2020
211146d
zip
tar.gz

0.3.6

Delete manually edited osm and net files (#17)

Jan 20, 2020
b97f9dd
zip
tar.gz

0.3.5

Reduce train, evaluate and re-train cycle (#14)

ISSUE:
------

Reduce the hyper parameter search cycle

FIX:
---

The longest task during training was the generation of the emissions
file which always grows do the decimal of the GBs and took a long time
to generate. The shift in scripts was always allow the info json (for
aggregate performance metrics) and the pickle (for recovering the
training models) to be generated by default. While enabling emissions'
file generation only on evaluation.

Jan 16, 2020
4cabe8b
zip
tar.gz

0.3.4

Update envs' reward and default distrib. (#14)

Jan 14, 2020
0e371fb
zip
tar.gz

0.3.3

Tune speed and count categories, fix tests (#14)

Jan 14, 2020
947a3b6
zip
tar.gz

0.3.2

Refactor baseline scripts into generic ilu (#13)

Jan 11, 2020
ca8d72a
zip
tar.gz

0.3.1

Stabilize baseline intersection scenario (#13)

Jan 10, 2020
f5a9d44
zip
tar.gz

0.3.0

Add flow and queue features to TLSAgent

	STATES:
	-------
	States were described by discretization of two variables
mean speeds and mean number of the vehicles trafficking over the
intersection scope (both incoming and outgoing sections).
 	Now two new variables have been added the mean throughput
(flow) and queue as it appears such variables are more common in
traffic light signal control literature. The flow is computed
only on the outgoing sections (everything that's exiting must have
entered at some point, while queues are computed only over incoming
section edges (outgoing edges make the incoming edges of neighbours)
	In addition to two new variables the variables are now
composable i.e before only mean speeds and mean number were allowed
now we may run experiments with either flow or queue but with both
together.

	ACTIONS:
	--------
	The choice criteria which only counted with epsilon greedy
now was expanded to incorporate the upper bound confidence interval.
see section 2.7 of [1]

    	REFERENCES
    	----------
    	[1] Sutton et Barto, Reinforcement Learning 2nd Ed 2018

Sep 20, 2019
319c81b
zip
tar.gz

0.2.7

Show statistics plot dynamically while training

Sep 11, 2019
c4b2232
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.3.8

0.3.7

0.3.6

0.3.5

0.3.4

0.3.3

0.3.2

0.3.1

0.3.0

0.2.7

Tags: gsavarela/ilurl