Export WordEmbeddingsTransform to ONNX #1249

vaeksare · 2018-10-12T23:54:46Z

Implements the ability to export the WordEmbeddingsTransform by converting it to an ONNX model, as well as expanding the functionality of some existing structures to allow for more efficient conversion implementation. The detailed conversion strategy can be found in comments inline. Fixes #1248

Testing was done through running the model on the same input using ML.NET and Lotus runtime directly using python bindings, producing the same results. The verified ONNX model (run in Lotus to check results) saved in Json format is used as a baseline for the formal tests.

The resulting ONNX model looks as follows:

Using placeholders for ONNX initiliazer nodes

dnfclas · 2018-10-12T23:54:57Z

All CLA requirements met.

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

wschin · 2018-10-17T20:00:46Z

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

+                //                      |                                |                                |
+                //                    J[j]                             K[j]                             L[j]
+                //                      |                                |                                |
+                //                       --------------------Concat (axis = 1) ---------------------------


``suggestion
// '-------------------Concat (axis = 1) ---------------------------'

Not fixed yet? #Resolved

Fixed now.

In reply to: 226109842 [](ancestors = 226109842)

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

justinormont

Looks good.

@vaeksare : Would it be worth writing a unit test which checks output equality between the WordEmbedding transform using ML.NET & its ONNX export? This will ensure the ONNX export stays in sync w/ the behavior of the transform in the future.

Currently, I think you are testing that the ONNX output is the expected value, but I'm concerned that when we do checkins which change the test values, we simply blindly update the expected test outputs without checking for equality between the ML.NET model & its ONNX conversion.

Possible cases (same type you manually checked): #1249 (comment)

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

wschin · 2018-10-24T23:33:45Z

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

+                //     /                                /      |         |                       \                  \
+                //    |                     Cast (to = int64)  | Cast (to = float)              Not                  |
+                //    |                            |           |         |                        |                  |
+                //    '------------ Add -----------'           | Scale (scale = 2.0)         Cast (to = int32)       |


Missing variable names between operators! Operators are never connected directly. There must be some variables between them. Please also make sure those names are consistent to your comments for your code. Thanks. #Resolved

Thanks, should be good now!

In reply to: 227994103 [](ancestors = 227994103)

vaeksare · 2018-10-24T23:35:05Z

@justinormont This was something I considered and talked to other people about initially. The issue that occurs is that our OnnxTransform (the only real way we have to run ONNX models directly) currently only works on Windows. As such, these tests would have to be made to only runs on Windows builds, which kind of defeats the purpose and means we still need the original ones. When the ability to run ONNX models is added to all platforms, this is likely something we should look into. But at the moment, I don't believe investing time into this is worth it.

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

…inelearning into savewordembeddings

wschin

Thanks. It looks great.

* add sweepable api example * fix tests * add grid search tuner * use grid search

vaeksare added 11 commits September 25, 2018 16:09

Prelimiary WordEmbeddingsTransform export to ONNX

d6171c9

Using placeholders for ONNX initiliazer nodes

More descriptive ONNX node names

df4b477

Merge remote-tracking branch 'upstream/master' into savewordembeddings

5975317

Fixed merge conflict

2665174

Merge remote-tracking branch 'upstream/master' into savewordembeddings

667ef35

Update to new signatures

ea54b11

Fix spacing

2fcad4d

Fix bugs with opset

f57ade8

Fix all bugs (works on Lotus)

fef06e2

Fixed comment spacing

626a448

Move case statements back

0267258

vaeksare self-assigned this Oct 12, 2018

vaeksare requested review from codemzs, Ivanidzo4ka and wschin October 12, 2018 23:54

vaeksare changed the title ~~Export WordEmbeddingsTransform to ONNX~~ WIP: Export WordEmbeddingsTransform to ONNX Oct 15, 2018

vaeksare added 4 commits October 16, 2018 10:27

Temp tests

cd68e90

Added tests

5343d97

Fixed tests

3198623

Merge remote-tracking branch 'upstream/master' into savewordembeddings

0d36fc7

vaeksare changed the title ~~WIP: Export WordEmbeddingsTransform to ONNX~~ Export WordEmbeddingsTransform to ONNX Oct 17, 2018

wschin reviewed Oct 17, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 17, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 17, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 17, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 17, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 17, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 17, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

Changed version back to fix tests

9074584

justinormont approved these changes Oct 24, 2018

View reviewed changes

wschin reviewed Oct 24, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Show resolved Hide resolved

wschin reviewed Oct 24, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Show resolved Hide resolved

wschin reviewed Oct 24, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Show resolved Hide resolved

wschin reviewed Oct 24, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Show resolved Hide resolved

wschin and others added 3 commits October 24, 2018 16:53

Update src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

f8dd2bc

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

Update src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

02e89d3

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

Update src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

ab486f6

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

wschin reviewed Oct 24, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 25, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 25, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 25, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin reviewed Oct 25, 2018

View reviewed changes

src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs Outdated Show resolved Hide resolved

wschin and others added 8 commits October 24, 2018 18:22

Update src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

b831a8e

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

Update src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

288c1d9

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

Update src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

92ed778

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

Update src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

f7c8a45

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

Update src/Microsoft.ML.Transforms/Text/WordEmbeddingsTransform.cs

1cae7ce

Co-Authored-By: vaeksare <vaeksare@microsoft.com>

Updated comment graph to include var names

2547354

Merge branch 'savewordembeddings' of https://github.com/vaeksare/mach…

cafdf1e

…inelearning into savewordembeddings

Make comments consistent

af1ee31

wschin approved these changes Oct 25, 2018

View reviewed changes

vaeksare merged commit 0cdde0f into dotnet:master Oct 25, 2018

vaeksare deleted the savewordembeddings branch October 25, 2018 17:23

LittleLittleCloud added a commit to LittleLittleCloud/machinelearning that referenced this pull request Jan 26, 2022

add sweepable api example (dotnet#1249)

e08cb85

* add sweepable api example * fix tests * add grid search tuner * use grid search

LittleLittleCloud added a commit that referenced this pull request Jan 28, 2022

add sweepable api example (#1249)

1c01078

* add sweepable api example * fix tests * add grid search tuner * use grid search

ghost locked as resolved and limited conversation to collaborators Mar 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Export WordEmbeddingsTransform to ONNX #1249

Export WordEmbeddingsTransform to ONNX #1249

vaeksare commented Oct 12, 2018 •

edited

Loading

dnfclas commented Oct 12, 2018 •

edited

Loading

wschin Oct 17, 2018 •

edited by vaeksare

Loading

wschin Oct 17, 2018 •

edited by vaeksare

Loading

vaeksare Oct 17, 2018

justinormont left a comment

wschin Oct 24, 2018 •

edited by vaeksare

Loading

vaeksare Oct 25, 2018

vaeksare commented Oct 24, 2018

wschin left a comment

Export WordEmbeddingsTransform to ONNX #1249

Export WordEmbeddingsTransform to ONNX #1249

Conversation

vaeksare commented Oct 12, 2018 • edited Loading

dnfclas commented Oct 12, 2018 • edited Loading

wschin Oct 17, 2018 • edited by vaeksare Loading

Choose a reason for hiding this comment

wschin Oct 17, 2018 • edited by vaeksare Loading

Choose a reason for hiding this comment

vaeksare Oct 17, 2018

Choose a reason for hiding this comment

justinormont left a comment

Choose a reason for hiding this comment

wschin Oct 24, 2018 • edited by vaeksare Loading

Choose a reason for hiding this comment

vaeksare Oct 25, 2018

Choose a reason for hiding this comment

vaeksare commented Oct 24, 2018

wschin left a comment

Choose a reason for hiding this comment

vaeksare commented Oct 12, 2018 •

edited

Loading

dnfclas commented Oct 12, 2018 •

edited

Loading

wschin Oct 17, 2018 •

edited by vaeksare

Loading

wschin Oct 17, 2018 •

edited by vaeksare

Loading

wschin Oct 24, 2018 •

edited by vaeksare

Loading