Simplify processing ASCII text files that have line ending character #295

yruslan · 2020-05-25T07:52:14Z

Background

Currently, all parameters must be specified in this code snippet:

    val parsedCopybook = CopybookParser.parseTree(ASCII(), copybook, dropGroupFillers = false, segmentRedefines = Seq(), stringTrimmingPolicy = StringTrimmingPolicy.TrimNone, ebcdicCodePage = CodePage.getCodePageByName("common"), nonTerminals = Seq())
    val cobolSchema = new CobolSchema(parsedCopybook, SchemaRetentionPolicy.CollapseRoot, false)
    val sparkSchema = cobolSchema.getSparkSchema

Feature

Simplify loading ASCII files by creating a method that has default options for all parameters.
Add direct support for such kind of files.
Update the documentation.

The text was updated successfully, but these errors were encountered:

Spark's support for text files is very limited for our use case since it does not support encodings and custom line endings. We need to re-implement this feature using a variable-length reader.

yruslan added the enhancement New feature or request label May 25, 2020

yruslan self-assigned this May 25, 2020

yruslan changed the title ~~Simplify copybook parsing for ASCII encoded data~~ Simplify processing ASCII text files that have line ending character May 25, 2020

yruslan added a commit that referenced this issue May 27, 2020

#295 Add a parsing method that has default values for all fields.

cb3996a

yruslan added a commit that referenced this issue May 27, 2020

#295 Add documentation for the new way of processing ASCII text files.

a586792

yruslan mentioned this issue May 27, 2020

#295 Simplify loading ASCII text files #296

Merged

yruslan closed this as completed in #296 May 28, 2020

yruslan added a commit that referenced this issue May 28, 2020

#295 Add a parsing method that has default values for all fields.

0480152

yruslan added a commit that referenced this issue May 28, 2020

#295 Add documentation for the new way of processing ASCII text files.

0d9a4f8

yruslan reopened this May 28, 2020

yruslan closed this as completed Jun 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify processing ASCII text files that have line ending character #295

Simplify processing ASCII text files that have line ending character #295

yruslan commented May 25, 2020 •

edited

Loading

Simplify processing ASCII text files that have line ending character #295

Simplify processing ASCII text files that have line ending character #295

Comments

yruslan commented May 25, 2020 • edited Loading

Background

Feature

yruslan commented May 25, 2020 •

edited

Loading