Skip to content

A github action that synchronises a Github repo data, including Backstage definition file information, with a Notion database

License

Notifications You must be signed in to change notification settings

infinitaslearning/notion-github-catalog

Use this GitHub action with your project
Add this Action to an existing workflow or create a new one
View on Marketplace

Repository files navigation

Notion Github Catalog

This action will scan all of the Github repositories available to the provided token and update their information in the specified Notion database.

TL;DR ... you love the idea of https://backstage.io/ but think its way too complex to manage and operate, so just use Notion instead :)

Notion integration and token

First, you need to have an integration access token - which you can get from https://www.notion.so/my-integrations after creating an integration. Give the integration a friendly name like 'Github Actions'.

By default integrations cant access any content so you you must share your database (or the parent page / tree it is contained within) with the integration you created earlier to be able to access it.

Notion Databases

This action expects a Notion database with the following properties, this will become the

  • Name: text
  • Description: text
  • Kind: select
  • Lifecycle: select
  • URL: url
  • Owner: select|relation
  • System: select|relation
  • DependsOn: relation (to self - new field - sync tasks)
  • Tags: multi_select
  • Visibility: select
  • Language: select
  • Status: select
  • Updated: date
  • Hash: text

The following notion page and database is connected to the tests by default, and can be used as a template: https://infinitaslearning.notion.site/Notion-Github-Catalogue-ac2395eda37144e698e6b8faef1003c7

It looks like this after it has run:

Screenshot 2021-12-19 at 12 55 39

Embedded Data

If your descriptor file contains links, these are added to an embedded database within the service page called Links.

Used information

The following fields are used to populate the database:

Name: from the descriptor file, if empty then the repository name Description: from the descriptor file, if empty then the repository description, if empty then the repository name Kind: from the descriptor file Lifecycle: from the descriptor file URL: from the descriptor file, if empty then part of the descriptor file URL, if empty then the repository URL Owner: from the descriptor file System: from the descriptor file DependsOn: from the descriptor file Tags: from the descriptor file, which are enriched with the Language information from GitHub, the connected teams in GitHub and for C# the version information from the csproj files. Visibility: from the repository visibility Language: from the repository (main language) Status: from the descriptor file Updated: current date/time

Relation Databases

You have an option to provide additional 'lookup' databases to convert some of the above selects into relations:

Owner, System

  1. Create a database that has at least a Name column that is unique, all other columns are up to you.
  2. Create an 'Unknown' row - this is what services that cannot be mapped go to (this name matters).
  3. In the config of the action, provide the input variable owner_database pointing to this database.
  4. In the main service catalogue table convert the Owner column to a relation, pointing at the above database.

This can be repeated for System. The full config is below in the usage.

Service Descriptor Format

This action expects each of your repositories to have a descriptor file format in the root of the repo in the form of a Backstage catalog-info file. This is because we are testing this approach against using Backstage directly, and wanted to leverage a format that perhaps has a chance of becoming a defacto standard. It does not currently map all fields from Backstage, but if you look at the code you can see what it does map. I may add a config option to allow mapping to be more dynamic in future (a good PR!).

Information on the format: https://backstage.io/docs/features/software-catalog/descriptor-format

The following types are supported:

Component

This is a basic component, with the following fields supported:

apiVersion: backstage.io/v1alpha1
kind: Component
metadata:
  name: app-ecommerce-web
  description: The front end application for noordhoff.nl, plantyn.com and liber.se
  links:
    - url: https://portal.azure.com/#@infinitaslearning.onmicrosoft.com/resource/subscriptions/4ebd66c4-aaad-4b1b-bb4e-740db9f1fc4d/resourceGroups/ecommerce-pro-rg/overview
      title: Azure
      icon: dashboard
    - url: https://liber-shop.production.infinicloud.app/
      title: Production
  tags:
    - ecommerce
    - application
    - react
spec:
  type: application
  lifecycle: production
  owner: commercial
  system: ecommerce
  dependsOn:
    - service-cms-api 

Fields other than those above are currently not supported, but could be with a PR.

API

This can be used, though we do not support the spec attribute, as it is impossible to render things like OpenAPI inside Notion, so these should be linked to via the links, but if you create a file exactly as Component above, it will be processed with the different Kind property (so be filterable).

Location

This should be used in the case of a mono-repo, that contains multile sub-components. Place the following in the root. Note that in the current implementation it needs to be a relative path to the root of the repository, and do not start paths with ./. No information from the Location file itself is added to the catalog, only the linked targets.

apiVersion: backstage.io/v1alpha1
kind: Location
spec:
  type: url
  targets:
    - sub-component-one/catalog-info.yaml
    - sub-component-two/catalog-info.yaml

Usage

This is typically deployed as a scheduled action:

name: Catalog
on:
  schedule:
    - cron:  '30 5 * * *'
  workflow_dispatch:
jobs:
  catalog:
    runs-on: ubuntu-latest
    steps:
     - name: Notion github catalog     
       uses: infinitaslearning/notion-github-catalog@main        
       with:          
         github_owner: infinitaslearning
         github_token: ${{ secrets.PAT_GITHUB_TOKEN }}
         notion_token: ${{ secrets.NOTION_TOKEN }}
         database: 2b26b4290cc84d95ad3e93c3255277a1    
         owner_database: 7943615f4dba43b3a3b0f991f4f7136d
         system_database: 121534684fe840a1953500e603c2b602
         repository_type: all
         repository_filter: .*
         push_missing: true
         catalog_file: catalog-info.yaml

To get the database ID, simply browse to it, click on the '...' in Notion, and get a 'Copy link'. The GUID at the end of the URL (but before the ?v=) is the id, this works on both embedded and full page databases.

Development

Assumes you have @vercel/ncc installed globally. After changes ensure you npm run build, commit and then submit a PR.

For the tests to run you need to have the environment variables set for GITHUB_TOKEN, NOTION_TOKEN and NOTION_DATABASE.

About

A github action that synchronises a Github repo data, including Backstage definition file information, with a Notion database

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •