Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Dependency Analysis Action and Dockerfile #1095

Closed

Conversation

naveensrinivasan
Copy link
Member

New scorecard action #1070

  • Add workflow to publish dependency analysis Docker image
  • Add a new filter function to filter slices
  • Add a GetScorecardChecks function to get scorecard checks
  • Add a GetScore function to get score of a repo
  • Add a Validate function to validate token, owner, repo, commitSHA, and PR
  • Add a new action file for OSSF Scorecard dependency analysis

ossf-tests/vulpy#2 - python
ossf-tests/vulpy#1 - GitHub action
naveensrinivasan#23 - golang

Example of the GitHub action https://github.com/ossf-tests/vulpy/blob/master/.github/workflows/scorecard-dependency.yml

@codecov
Copy link

codecov bot commented Feb 23, 2023

Codecov Report

Merging #1095 (c20c23d) into main (570a953) will decrease coverage by 23.30%.
The diff coverage is 17.66%.

Additional details and impacted files

Impacted file tree graph

@@             Coverage Diff             @@
##             main    #1095       +/-   ##
===========================================
- Coverage   62.94%   39.65%   -23.30%     
===========================================
  Files           4        5        +1     
  Lines         251      517      +266     
===========================================
+ Hits          158      205       +47     
- Misses         77      291      +214     
- Partials       16       21        +5     
Impacted Files Coverage Δ
dependency-analysis/main.go 17.66% <17.66%> (ø)

 New scorecard action ossf#1070

- Add workflow to publish dependency analysis Docker image
- Add a new filter function to filter slices
- Add a GetScorecardChecks function to get scorecard checks
- Add a GetScore function to get score of a repo
- Add a Validate function to validate token, owner, repo, commitSHA, and PR
- Add a new action file for OSSF Scorecard dependency analysis
- Add structs for ScorecardResult, Check, DependencyDiff, and V

Signed-off-by: naveensrinivasan <172697+naveensrinivasan@users.noreply.github.com>
@naveensrinivasan naveensrinivasan force-pushed the naveen/feat/dependency-analysis branch 3 times, most recently from 32cfd28 to b8b6508 Compare February 24, 2023 18:47
- Update Apache License in `main_test.go`
- Add `paralleltest` and `govet` comments
- Change `_` to `//nolint:paralleltest` and `//nolint:govet`
- Change `0644` to `0o644`
- Remove test for invalid owner
- Remove owner from `Validate` function
- Remove a line from the `Vulnerability` struct

[dependency-analysis/main_test.go]
- Add Apache License to the file
- Add `paralleltest` and `govet` comments
- Change `_` to `//nolint:paralleltest` and `//nolint:govet`
- Change `0644` to `0o644`
- Remove test for invalid owner
- Remove owner from `Validate` function
[dependency-analysis/types.go]
- Remove a line from the `Vulnerability` struct

Signed-off-by: naveensrinivasan <172697+naveensrinivasan@users.noreply.github.com>
@naveensrinivasan naveensrinivasan force-pushed the naveen/feat/dependency-analysis branch from b8b6508 to 6998dbe Compare February 24, 2023 18:50
[dependency-analysis/README.md]
- Enable the GitHub Dependency Graph API
- Add a configuration input to specify the checks to run
- Add a comment to the pull request with the results of the analysis
- Update README with installation instructions

Signed-off-by: naveensrinivasan <172697+naveensrinivasan@users.noreply.github.com>
dependency-analysis/README.md Outdated Show resolved Hide resolved
dependency-analysis/action.yaml Show resolved Hide resolved
dependency-analysis/README.md Outdated Show resolved Hide resolved
dependency-analysis/main.go Outdated Show resolved Hide resolved
dependency-analysis/main.go Outdated Show resolved Hide resolved
dependency-analysis/main.go Show resolved Hide resolved
dependency-analysis/main.go Outdated Show resolved Hide resolved
dependency-analysis/main.go Show resolved Hide resolved
dependency-analysis/main.go Outdated Show resolved Hide resolved
Copy link

@raghavkaul raghavkaul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for working on this, some suggestions for code cleanup.

Dockerfile-dependency-analysis Outdated Show resolved Hide resolved
dependency-analysis/README.md Show resolved Hide resolved
dependency-analysis/README.md Outdated Show resolved Hide resolved
dependency-analysis/main.go Show resolved Hide resolved
dependency-analysis/main.go Outdated Show resolved Hide resolved
dependency-analysis/main.go Show resolved Hide resolved
dependency-analysis/main.go Show resolved Hide resolved
dependency-analysis/main.go Show resolved Hide resolved
dependency-analysis/main.go Outdated Show resolved Hide resolved
// GetScorecardChecks returns the list of checks to run.
// This uses the SCORECARD_CHECKS environment variable to get the path to the checks list.
func GetScorecardChecks() ([]string, error) {
fileName := os.Getenv("SCORECARD_CHECKS")

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agree that it should be parameterized for testability; also wonder if filename is the right approach? If the user is configuring the DependencyDiff action, can their string environment variable be the comma-separated list of checks instead of the name of a file with the list of checks?

@naveensrinivasan naveensrinivasan force-pushed the naveen/feat/dependency-analysis branch 3 times, most recently from 5f5499b to 0d4382c Compare February 27, 2023 19:04
- Update Dockerfile for dependency analysis
- Rename GetScore function to GetScorecardResult
- Update README title and link to GitHub Dependency Review documentation
- Add environment variable for list of checks
- Add checks for valid repo
- Filter out dependencies that are not added
- Filter out dependencies that do

[Dockerfile-dependency-analysis]
- Rename file from `Dockerfile-dependency-analysis` to `Dependency-analysis.dockerfile`
[dependency-analysis/main_test.go]
- Change the name of the GetScore function to GetScorecardResult
- Lower the minimum score required in the test from `got.Score < tt.score` to `got.Score <= tt.score`
[dependency-analysis/README.md]
- Change the title of the README from `OpenSSF Dependency Analysis` to `OpenSSF Scorecard Dependency Analysis`
- Change the link to the GitHub Dependency Review documentation
- Change the action name to `ossf/scorecard-action/dependency-analysis@main`
[dependency-analysis/main.go]
- Convert the PR number to an integer
- Move the `octokit` initialization to a separate file
- Add an environment variable to get the list of checks
- Add a check for a valid repo
- Convert the PR number to an integer
- Add a function to get the HTML for vulnerabilities
- Add a function to get the scorecard result
- Filter out dependencies that are not added
- Filter out dependencies that do
[.github/workflows/publish-dependency-image.yml]
- Change the file name for the Dockerfile from `Dockerfile-dependency-analysis` to `Dependency-analysis.dockerfile`

Signed-off-by: naveensrinivasan <172697+naveensrinivasan@users.noreply.github.com>
@naveensrinivasan naveensrinivasan force-pushed the naveen/feat/dependency-analysis branch from 0d4382c to c20c23d Compare February 27, 2023 19:13
Copy link
Contributor

@azeemshaikh38 azeemshaikh38 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Question/comment on this design: #1070 (comment)

Copy link

@raghavkaul raghavkaul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, LGTM.

@naveensrinivasan
Copy link
Member Author

Question/comment on this design: #1070 (comment)

We are continuing the discussion on the ☝️ thread

@naveensrinivasan
Copy link
Member Author

Question/comment on this design: #1070 (comment)

A friendly ping

packages: write

steps:
- name: Checkout repository
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

## Usage
The OpenSSF Dependency Analysis is a GitHub Action that can be easily incorporated into a workflow.
The workflow can be triggered on a pull request event.
The action will run on the latest commit on the default branch of the repository, and will create a comment on the pull request with the results of the analysis.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe clarify what this means. Default branch of which repo?

The OpenSSF Dependency Analysis is a GitHub Action that can be easily incorporated into a workflow.
The workflow can be triggered on a pull request event.
The action will run on the latest commit on the default branch of the repository, and will create a comment on the pull request with the results of the analysis.
An example of the comment can be found [here](https://github.com/ossf-tests/vulpy/pull/2#issuecomment-1442310469).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are we showing the aggregate score in the summary for each result? Do users need to click to see it?
Are we creating a new comment for each run?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are we showing the aggregate score in the summary for each result

We aren't showing aggregate score. Do you think we should show the aggregate score?

Are we creating a new comment for each run?

We will not create results for reach run. Only if the user manually runs it only then we create another comment.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

so we run it once for the first commit, and then only if the user asks to run?
Maybe add to the comment description to explain to users that they can re-run to update?

An example of the comment can be found [here](https://github.com/ossf-tests/vulpy/pull/2#issuecomment-1442310469).

## Prerequisites
The actions require enabling the [GitHub Dependency Review](https://docs.github.com/en/code-security/supply-chain-security/understanding-your-software-supply-chain/about-dependency-review) for the repository.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this really true? For go projects, unlikely. For other projects, is it? We don't need an exact dependency tag, so it should not be needed. Does GitHub API return dependency name / repo without enabling this feature?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this really true? For go projects, unlikely. For other projects, is it? We don't need an exact dependency tag, so it should not be needed. Does GitHub API return dependency name / repo without enabling this feature?

I have tested for go projects and here is python package https://github.com/ossf-tests/vulpy/pull/2/files. For teh GH API to return results it needs this feature to be tuned on.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very interesting... and unexpected to me

### Configuration
The action can be configured using the following inputs:

- `SCORECARD_CHECKS`: This environment variable takes a file containing a list of checks to run.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need JSON format? I could imagine a single Action input that takes a comma-separated list check1, check2, check3. Action inputs are the standard way to pass parameters into Action, no?

Do you anticipate more complicated structure needed in the future?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can give users the option to exclude certain repositories or checks for a particular repo, which we can make possible with the file option.

This will also give us the ability to extend the feature in the future. That's the idea.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should have a config file in this case then. Standard way to pass parameters to an Action is not env variables. It's inputs.

log.Fatal(err)
}

ts := oauth2.StaticTokenSource(&oauth2.Token{AccessToken: token})
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OAuth tokens is required to access the BQtable? Can you add a comment why it's needed?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is GH Client access.

ts := oauth2.StaticTokenSource(&oauth2.Token{AccessToken: token})
	tc := oauth2.NewClient(context.Background(), ts)
	client := github.NewClient(tc)

I think from the code we should be able to make it being used in GH Client API. Unless you think it is still necessary.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

auth token must bee needed to use the GITHUB_TOKEN. That makes sense now :-)

url := strings.TrimPrefix(k, "https://")
scorecard, err := GetScorecardResult(url)
if err != nil {
if len(i.Vulnerabilities) > 0 {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why are we treating Vulns as a special case?

I worry our data is inaccurate, ie we're using a dependency HEAD whereas a user is installing a particular package version. There are other tools that will do a better job at this than Scorecard, like dependabot.

I'm thinking we may even remove the vuln results entirely because of how poor our data may be. Otherwise users will be confused and unhappy that we're claiming there are vulns in a package if there are none.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK. I agree. I can remove the Vulns.

if checks.Repo.Name == "" {
return ""
}
sb := strings.Builder{}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe create a template file rather than hardcoded this layout in code. It' pretty hard to read otherwise

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree. I can do that.

if fileName == "" {
// default to critical and high severity checks
return []string{
"Dangerous-Workflow",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we instead be returning just the aggregate score and link to deps.dev for better UX? Or at least provide a link to deps.dev for users to see more results?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can add the aggregate score but not every repo has the deps.dev and that is the reason not to include deps.dev.

t.Errorf("GetScorecardChecks() error = %v, wantErr %v", err, tt.wantErr)
return
}
if !reflect.DeepEqual(got, tt.want) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: use go-cmp package for deep comparison

naveensrinivasan added a commit to ossf/scorecard that referenced this pull request May 23, 2023
- Remove depedencydiff as we are building something similar to this ossf/scorecard-action#1095

Signed-off-by: naveensrinivasan <172697+naveensrinivasan@users.noreply.github.com>
@spencerschrock
Copy link
Member

Closing as this was implemented in a separate repo instead.
https://github.com/ossf/scorecard-dependencyanalysis

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants