Developing a component

Smithy offers an SDK to allow users to easily develop custom components for the platform and share them with other users. As of the time of this writing, the SDK supports the following component types:

targets
scanners
filters
enrichers
reporters

Component Types

Targets are components that fetch artifacts from an API, source code host, etc. and these artifacts are the ones that will be scanned by the next layer of components

Scanners are the components that use some binary, API or custom logic to discover vulnerabilities in an artifact and will then parse the results into Vulnerability Finding objects of the OCSF standard

Filters get a list of all the Vulnerability Findings discovered within the context of the workflow execution and hide some of them from the rest of the components

Enrichers add more context and information to the Vulnerability Findings

Reporters report the Vulnerability Findings to some external system, such as a database, a messenger, etc.

Component Execution

Components execute in a specific order that can't be modified and is the following:

Targets
Scanners
Filters
Enrichers
Reporters

Components can be added in any order in a workflow specification, however they will always be ordered to match the order specified above. The filters are the only components that are optional, for each one of the other types of components you need to have at least on in the workflow specification in order for it to be considered valid.

Component Types and SDK

The SDK models how each type of components interacts with the Vulnerability Findings.

Targets don't have any interaction, they just fetch the artifacts so that they can become accessible by the scanners.

Scanners produce Vulnerability Findings and the SDK ensures that they will be stored in the database. They don't modify any existing vulnerabilities.

Filters don't create new Vulnerability Findings, they can hide them but they can't delete them. A Vulnerability Finding once produced by a scanner, will remain in the database, it can be hidden, annotated or contextualised but not deleted.

Enrichers enrich Vulnerability Findings that have not been hidden by the filters. They operate only on data that have been produced in the context of the current workflow execution, they don't have access to historical data.

Reporters get a list of enriched Vulnerability Findings, they can't modify them in any way but they can report them to an external system for further processing.

Building a component's images

If you want to build a component's images and share the manifest with other users you can do so using the following commands:

smithyctl --debug-enabled component build --sdk-version v1.0.0 \
                                          --tag v0.0.9 \
                                          --registry-url <some-registry> \
                                          components/scanners/<your-component-name>/component.yaml

smithyctl --debug-enabled component package --sdk-version v1.0.0 \
                                            --version v0.0.9 \
                                            --registry-url <some-registry> \
                                            components/scanners/<your-component-name>/component.yaml

The first command will build any images required by the component and the second will resolve all the image references in the component and will then push the rendered component manifest to the registry.

A component's image must be an abstract reference that matches the path of the component in the repository. For example, the scanner named sql-injection whose manifest resides in the path components/scanners/sql-injection/component.yaml could have the following image references:

ubuntu:24.04
components/scanners/sql-injection
components/scanners/sql-injection/helper

The first image is a third-party image, it will be resolved to index.docker.io/library/ubuntu:24.04 and will be fetched before the component starts its execution. The second and third images are component images, meaning that smithyctl is able to build them for you in a consistent way. At the moment, the Smithy SDK is only available in Go and hence we have a standard way of building our images using a base Dockerfile and changing the context and a couple of build arguments.

Since each component image is practically a path in the repo, we expect all images to be Go modules located in the respective path. The image build will use the path as the context path. You can set one or more tags and the registry host to be added to the resulting image reference. So, the command:

smithyctl --debug-enabled component build --sdk-version v1.0.0 \
                                          --tag v0.0.9 \
                                          --tag latest \
                                          --tag current-staging \
                                          --registry-url some-registry.com \
                                          --namespace smithy \
                                          --base-component-dockerfile components/Dockerfile \
                                          components/scanners/sql-injection/component.yaml

will result in 2 images being built with the following tags:

some-registry.com/smithy/components/scanners/sql-injection:v0.0.9
some-registry.com/smithy/components/scanners/sql-injection:latest
some-registry.com/smithy/components/scanners/sql-injection:current-staging
some-registry.com/smithy/components/scanners/sql-injection/helper:v0.0.9
some-registry.com/smithy/components/scanners/sql-injection/helper:latest
some-registry.com/smithy/components/scanners/sql-injection/helper:current-staging

The Docker command that would yield the same result would like this:

docker build -t some-registry.com/smithy/components/scanners/sql-inection:v0.0.9 -f components/Dockerfile --build-arg ... components/scanners/sql-injection/
docker build -t some-registry.com/smithy/components/scanners/sql-inection:v0.0.9 -f components/Dockerfile --build-arg ... components/scanners/sql-injection/helper/

You can get a report of what would the smithyctl component build resolve and built by using the --dry-run flag to get a machine readable report of all the actions in the stdout. An example for the current version of the gosec component is the following:

./bin/smithyctl/cmd/linux/amd64/smithyctl component build --dry-run \
                                                          --tag v0.0.9 \
                                                          --tag latest \
                                                          --tag current-staging \
                                                          --sdk-version v1.0.0 \
                                                          new-components/scanners/gosec/component.yaml 
...
custom_images:
    - tags:
        - ghcr.io/smithy-security/smithy/images/new-components/scanners/gosec:v0.0.9
        - ghcr.io/smithy-security/smithy/images/new-components/scanners/gosec:latest
        - ghcr.io/smithy-security/smithy/images/new-components/scanners/gosec:current-staging
      labels:
        org.opencontainers.image.source: https://github.com/smithy-security/smithy
      build_args:
        COMPONENT_PATH: new-components/scanners/gosec
        SDK_VERSION: v1.0.0
      context_path: new-components/scanners/gosec
      dockerfile: new-components/Dockerfile
      component_path: new-components/scanners/gosec
      platform: linux/amd64
external_images:
    docker.io/securego/gosec:2.15.0: {}

A component manifest package can be built independently of an image. The packaging command renders all component images to have the full image URLs for all the steps. Contrary to the image building command, the packaging command doesn't allow for component images to have multiple tags, because only one can be rendered in the component.

Suppose that our sql-injection component has the following manifest:

name: sql-injection
type: scanner
steps:
  - name: scanner
    image: ubuntu:24.04
    executable: /bin/app
  - name: parser
    image: components/scanners/sql-injection
    executable: /bin/app
  - name: parser
    image: components/scanners/sql-injection/helper
    executable: /bin/app

The command

./bin/smithyctl/cmd/linux/amd64/smithyctl component package --version v0.0.9 \
                                                            --sdk-version v1.0.0 \
                                                            --registry-url some-registry.com \
                                                            --namespace smithy \
                                                            new-components/scanners/sql-injection/component.yaml

will render it into the following manifest before packaging into OCI and pushing it:

name: sql-injection
type: scanner
steps:
  - name: scanner
    image: index.docker.io/library/ubuntu:24.04
    executable: /bin/app
  - name: parser
    image: some-registry.com/smithy/components/scanners/sql-injection:v0.0.9
    executable: /bin/app
  - name: parser
    image: some-registry.com/smithy/components/scanners/sql-injection/helper:v0.0.9
    executable: /bin/app

If you wish to apply some rendering on the packaged manifest with go templates for example, you could run the following command:

./bin/smithyctl/cmd/linux/amd64/smithyctl component package --version '{{ some-version.parameter }}' \
                                                            --sdk-version v1.0.0 \
                                                            --registry-url some-registry.com \
                                                            --namespace smithy \
                                                            new-components/scanners/sql-injection/component.yaml

and you will get the following result:

name: sql-injection
type: scanner
steps:
  - name: scanner
    image: index.docker.io/library/ubuntu:24.04
    executable: /bin/app
  - name: parser
    image: some-registry.com/smithy/components/scanners/sql-injection:{{ some-version.parameter }}
    executable: /bin/app
  - name: parser
    image: some-registry.com/smithy/components/scanners/sql-injection/helper:{{ some-version.parameter }}
    executable: /bin/app

OCSF Field Mappings for Scanners

OCSF is a very expressive standard. As such it has a variety of fields, subfields and various object that can be expressed. It is important to define strict rules and maintain discipline on mappings so that each component treats information the same way. This is so that the datalake produced by Smithy remains useful instead of becoming the data swamp of doom, a place where data goes to rot since it requires significant manual effort to understand.

Rules for mapping:

Glossary:

A finding is a single instance of a potential vulnerability from any security tool. For example for SAST, a finding is an ocurrnce of a single CWE in a specific file and line.

Rules:

Each scanner should report one instance of each vulnerability as a an instance of a "VulnerabilityFinding" object with a single vulnerability in it. This is because at the current OCSF version, an individual Vulnerability object cannot be enriched.
The title of each finding is: details.'findingInfo'.'title'
Titles should uniquely identify a type of finding and be as understandable to humans as possible. For exaaple: Trivy's SARIF output, sets the title for license related finding to "License", this is obviously a bad title. A good title would have been: "Potentially Dangerous License: GPL2.0". This isn't a problem only with Trivy, several tools report overly short info.
Each Scanner should report the name of the tool it parses information for as part of each finding in the field "details.findingInfo.productUid".
If the corresponding tool reports Confidence, each Scanner should propagate the reporting tool's confidence in the field 'details.confidenceId' and 'details.confidence'.
The field 'details.confidence' should be a textual representation of 'details.confidenceID'
If the corresponding tool reports Severity, each Scanner should propagate the reporting tool's severity in the field 'details.severityID' and 'details.severity'.
The field 'details.severity' should be a textual representation of 'details.severityID'
The field 'details.message' should contain a concise and human friendly description of exactly what the problem is with ideally, an explanation of the impact and maybe remediation advice. For example: GoSec has descriptions such as

"Implicit memory aliasing in for loop."

this means very little without extensive extra enrichment. A good description is the following:

golang.org/x/text/language in golang.org/x/text before 0.3.7 can panic with an out-of-bounds read during BCP 47 language tag parsing. Index calculation is mishandled. If parsing untrusted user input, this can be used as a vector for a denial-of-service attack."
A scanner should mandatorily set a Datasource to a supported TargetType depending on if the target being scanned is a Repository, Dependency, OCIPackage or Website.
A scanner should try to propagate as much information from the source as possible. For example if the source tool reports fixes in a machine-understandable way, the scanners should convert those fixes to the relevant OCSF fields.
If the source tool reports common identifiers such as CVE ids, CWEs, CREs or any other Common Identifiers, the scanner should make every effort to propagate this information

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Developing a component

Component Types

Component Execution

Component Types and SDK

Building a component's images

OCSF Field Mappings for Scanners

Rules for mapping:

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Developing a component

Component Types

Component Execution

Component Types and SDK

Building a component's images

OCSF Field Mappings for Scanners

Rules for mapping: