Is it really byte-level?

From your paper it seems like the byte-level classification decomposes a character i.e. 'C' into its binary representation, something like 000101110, but your code gives back 68, which I think it's not what you intended, cause that is simply a char level representation.

Am I wrong?

Your dataset would be still fulfilling its purpose of using very long sequences, but I think it's not char-byte-level, but char-level.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it really byte-level? #61

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development