Copybook.parseSimple not dropping the fillers in output ast #324

ghost · 2020-09-17T14:04:50Z

String copybookContents = "01 RECORD.
05 FILLER PIC X(1).
05 COMPANY_PREFIX PIC X(3).
05 FILLER PIC X(1).
05 FILLER PIC X(1).
05 COMPANY_NAME PIC X(9)."

Group grp = CopybookParser.parseSimple(copybookContents, true,true,commentPolicy.apply(false,6,72)).ast();

in the above scenario as drop_value_fillers is true then the output ast also should not contain the FILLERS. But output ast is providing each column details.

yruslan · 2020-09-22T13:24:30Z

Thanks for the bug report. Dropping of fillers is a feature of 'spark-cobol'. Fillers should be deopped in Spark schema while still can be present in the AST.
But your expectation makes sense. We can drop fillers from AST as well.

yruslan · 2022-02-02T09:19:09Z

Hi, I was revisiting this issue.
Each AST element has isFiller flag, so you can always ignore all fields if the flag is true.

Is it necessary for your use case to actually remove fillers from the AST if you have this flag?

ghost · 2022-02-02T10:05:07Z

That's how we are handling in our code while parsing the ast. It's a good to have feature to drive via parsesimple

yruslan · 2022-02-02T10:07:55Z

Could you please elaborate on what are you trying to achieve?

Every use case that I can think of can be done by just ignoring fields (AST statements) for which isFiller == true.

ghost · 2022-02-02T11:03:33Z

We are currently parsing the copybook using ast and converting to json. We are using parsesimple to get the ast. if based on the user options if we can drop or retain the filler in ast then for us its not required to handle while converting back to json.

yruslan · 2022-02-02T11:51:40Z

Sure, but using node.isFiller can be used to achieve the same, isn't it?

cobrix/cobol-parser/src/main/scala/za/co/absa/cobrix/cobol/parser/ast/Statement.scala

Line 81 in 3014923

def isFiller: Boolean

What's the code snippet that you are using?

yruslan · 2022-02-02T11:56:40Z

I see why you are asking for the feature. There are parameters that say 'dropGroupFiller' and 'dropValueFillers' but nothing actually dropped, just marked. It makes sense. We can implement the dropping as well

ghost · 2022-02-02T14:53:26Z

You are correct. Currently we are parsing and dropping the fillers by utilizing the isFiller. currently the parameter we are sending for dropping filler to parseSimple is not functional.

yruslan · 2022-02-03T08:38:41Z

This is implemented.

Please, check the latest master. Note that the signature of the method (parseSimple) has changed. It now reflects what is actually being done.

Spark schemas doe not support having 2 column names having tha same name so FILLERs need to be renamed in order to be retained. So the method allow controlling if fillers are going to be renamed, and a separate flag that controls if non-renamed fields should be dropped.

ghost · 2022-02-03T11:00:43Z

@yruslan If signature is changed then it will break our existing code if we update the version, we need to make it backward compatible. Can we have it overloaded.

yruslan · 2022-02-03T13:21:32Z

Okay, fair point. Will make it compatible

ghost · 2022-02-03T13:27:34Z

@yruslan thanks a lot. you can mark the existing parsesimple signature to @deprecated

yruslan · 2022-02-03T13:44:22Z

The changes are in master - you can check. To retain the compatibility the behavior is the same by default.
Use 'dropFillersFromAst = true' to actually drop fillers from the AST.

ghost · 2022-02-04T03:46:33Z

So as in parseSimple is still have a new parameter dropFillersFromAst in the signature so once we upgrade we have to provide the value in our calling code. In scala it won't ask as the default value is given for it as false but in Java we have to explicitly specify to false while calling.

yruslan · 2022-02-04T07:02:48Z

Yeah, but this is the only way to preserve backward compatible behavior. Since the method has default values it cannot be overloaded.

yruslan · 2022-02-04T10:02:43Z

2.4.8 is released and it has the fix.

Btw, Cobrix has converters project that currently just provides examples on how to convert mainframe files to JSON without Spark. Consider contributing your converter to the project :)

ghost · 2022-02-04T17:30:26Z

Sure I would be happy to contribute.
Can I know the overall requirement and detail plan for it. Is there any high level design document or still it is in process. I can contribute their also. Any plans to have the code base in Java or it will be completely in scala :)

yruslan · 2022-02-07T10:55:09Z

We don't have an exact plan at the moment. Just ideas. The one that we might likely to implement is a command-line tool + library that allows converting mainframe files to JSON.

Yes, the project will continue to be in Scala.

ghost · 2022-02-07T12:03:16Z

Sure then I think of 2 requirements

Converting copybook ast to corresponding json
Converting copybook to a map with key as parent hierarchy and value as column name.

ghost added the bug Something isn't working label Sep 17, 2020

yruslan added the accepted Accepted for implementation label Sep 22, 2020

yruslan added a commit that referenced this issue Feb 3, 2022

#324 Drop fillers from AST if requested when invoking 'parseSimple()'

882eaec

yruslan added a commit that referenced this issue Feb 3, 2022

#324 Update README.md

4b860db

yruslan added a commit that referenced this issue Feb 3, 2022

#324 Drop fillers from AST if requested when invoking 'parseSimple()'

f2e9775

yruslan added a commit that referenced this issue Feb 3, 2022

#324 Update README.md

6d5a4cc

yruslan added a commit that referenced this issue Feb 3, 2022

#324 Make parseSimple() signature backwards compatible.

2ecd0fe

yruslan added a commit that referenced this issue Feb 3, 2022

#324 Make parseSimple() signature backwards compatible.

298e2f4

yruslan closed this as completed Feb 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Copybook.parseSimple not dropping the fillers in output ast #324

Copybook.parseSimple not dropping the fillers in output ast #324

ghost commented Sep 17, 2020

yruslan commented Sep 22, 2020

yruslan commented Feb 2, 2022

ghost commented Feb 2, 2022

yruslan commented Feb 2, 2022

ghost commented Feb 2, 2022

yruslan commented Feb 2, 2022

yruslan commented Feb 2, 2022

ghost commented Feb 2, 2022 •

edited by ghost

Loading

yruslan commented Feb 3, 2022

ghost commented Feb 3, 2022 •

edited by ghost

Loading

yruslan commented Feb 3, 2022

ghost commented Feb 3, 2022 •

edited by ghost

Loading

yruslan commented Feb 3, 2022

ghost commented Feb 4, 2022

yruslan commented Feb 4, 2022

yruslan commented Feb 4, 2022

ghost commented Feb 4, 2022 •

edited by ghost

Loading

yruslan commented Feb 7, 2022

ghost commented Feb 7, 2022

Copybook.parseSimple not dropping the fillers in output ast #324

Copybook.parseSimple not dropping the fillers in output ast #324

Comments

ghost commented Sep 17, 2020

yruslan commented Sep 22, 2020

yruslan commented Feb 2, 2022

ghost commented Feb 2, 2022

yruslan commented Feb 2, 2022

ghost commented Feb 2, 2022

yruslan commented Feb 2, 2022

yruslan commented Feb 2, 2022

ghost commented Feb 2, 2022 • edited by ghost Loading

yruslan commented Feb 3, 2022

ghost commented Feb 3, 2022 • edited by ghost Loading

yruslan commented Feb 3, 2022

ghost commented Feb 3, 2022 • edited by ghost Loading

yruslan commented Feb 3, 2022

ghost commented Feb 4, 2022

yruslan commented Feb 4, 2022

yruslan commented Feb 4, 2022

ghost commented Feb 4, 2022 • edited by ghost Loading

yruslan commented Feb 7, 2022

ghost commented Feb 7, 2022

ghost commented Feb 2, 2022 •

edited by ghost

Loading

ghost commented Feb 3, 2022 •

edited by ghost

Loading

ghost commented Feb 3, 2022 •

edited by ghost

Loading

ghost commented Feb 4, 2022 •

edited by ghost

Loading