Skip to main content

Assemblyline 4 - Service base

Project description

Assemblyline 4 - Service Base

This repository provides the base service functionality for Assemblyline 4 services.

Creating a new Assemblyline service

Service file structure

An Assemblyline service has the following file structure:

assemblyline-service-<service name>
│
├── Dockerfile
├── <service name>.py
└── service_manifest.yml

This is overview of what each of these does:

  • Dockerfile ─ Build file for the service container, see Dockerfile section below for more details
  • <service name>.py ─ Contains main service code
  • service_manifest.yml ─ Service definition file, see Service manifest section below for more details

Service manifest

Every service must have a service_manifest.yml file in its root directory. The manifest file presents essential information about the service to the Assemblyline core system, information the system must have before it can run the service.

The diagram below shows all the elements that the manifest file can contain, including a brief description of each.

# Name of the service
name: ResultSample
# Version of the service
version: 1
# Description of the service
description: >
  ALv4 Result example service

  This service provides examples of how to:
     - define your service manifest
     - use the different section types
     - use tags
     - use heuristics to score sections
     - use the att&ck matrix
     - use the updater framework
     - define submission parameters
     - define service configuration parameters

# Regex defining the types of files the service accepts and rejects
accepts: .*
rejects: empty|metadata/.*

# At which stage the service should run (one of: FILTER, EXTRACT, CORE, SECONDARY, POST)
# NOTE: Stages are executed in the order defined in the list
stage: CORE
# Which category the service is part of (one of: Antivirus, Dynamic Analysis, External, Extraction, Filtering, Networking, Static Analysis)
category: Static Analysis

# Does the service require access to the file to perform its task
# If set to false, the service will only have access to the file metadata (e.g. Hashes, size, type, ...)
file_required: true
# Maximum execution time the service has before it's considered to be timed out
timeout: 60
# Does the service force the caching of results to be disabled
# (only use for service that will always provided different results each run)
disable_cache: false

# is the service enabled by default
enabled: true
# does the service make APIs call to other product not part of the assemblyline infrastructure (e.g. VirusTotal, ...)
is_external: false
# Number of concurrent services allowed to run at the same time
licence_count: 0

# service configuration block (dictionary of config variables)
# NOTE: The key names can be anything and the value can be of any types
config:
  str_config: value1
  int_config: 1
  list_config: [1, 2, 3, 4]
  bool_config: false

# submission params block: a list of submission param object that define parameters
#                          that the user can change about the service for each of its scans
# supported types: bool, int, str, list
submission_params:
  - default: ""
    name: password
    type: str
    value: ""
  - default: false
    name: extra_work
    type: bool
    value: false

# Service heuristic blocks: List of heuristics object that define the different heuristics used in the service
heuristics:
  - description: This the first Heuristic for ResultSample service.
    filetype: pdf
    heur_id: AL_RESULTSAMPLE_1
    name: Masks has PDF
    score: 100
    attack_id: T1001
  - description: This is second Heuristic for ResultSample service.
    filetype: exe
    heur_id: AL_RESULTSAMPLE_2
    name: Drops an exe
    score: 1000
  - description: This is third Heuristic for ResultSample service.
    filetype: exe
    heur_id: AL_RESULTSAMPLE_3
    name: Extraction information
    score: 0

# Docker configuration block which defines:
#  - the name of the docker container that will be created
#  - cpu and ram allocation by the container
docker_config:
  image: cccs/assemblyline-service-resultsample:latest
  cpu_cores: 1.0
  ram_mb_min: 128
  ram_mb: 256

# Update configuration block
update_config:
  # list of source object from where to fetch files for update and what will be the name of those files on disk
  sources:
    - uri: https://file-examples.com/wp-content/uploads/2017/02/zip_2MB.zip
      name: sample_2mb_file
    - uri: https://file-examples.com/wp-content/uploads/2017/02/zip_5MB.zip
      name: sample_5mb_file
  # intervale in seconds at which the updater runs
  update_interval_seconds: 300
  # Should the downloaded files be used to create signatures in the system
  generates_signatures: false

Dockerfile

A Dockerfile is required to build the service container that will be executed in the system.

The following items must be set for all services:

  • All services must be based on the cccs/assemblyline-v4-service-base:latest image
  • An environment variable must be set for the service path
  • Install any service requirements
  • Copy the service code into /opt/al/al_service/
FROM cccs/assemblyline-v4-service-base:latest

# Set the service path
ENV SERVICE_PATH result_sample.ResultSample

# By default, the base service container as the assemblyline user as the running user
#  switch to root to perform installation of dependancies
USER root

# See that we all these operations in one line to reduce
#  the number of container layers and size of the container
RUN apt-get update && apt-get install -y my_debian_apt_dependency_package && rm -rf /var/lib/apt/lists/*
RUN pip install --no-cache-dir --user my_pip_dependency && rm -rf ~/.cache/pip

# Change to the assemblyline user to make sure your service does not run as root
USER assemblyline

# Copy the service code in the service directory
WORKDIR /opt/al_service
COPY assemblyline_result_sample_service .

Testing an Assemblyline service

To test an Assemblyline service in standalone mode, the run_service_once.py script can be used to run a single task through the service for testing. That script does not require that you have a working version of Assemblyline installed, all you need are the Assemblyline python libraries.

Setting up dev environment

NOTE: The following environment setup has only been tested on Ubuntu 20.04.

  1. Install required packages

    sudo apt-get install build-essential libffi-dev python3.7 python3.7-dev python3-pip automake autoconf libtool
    
  2. Install Assemblyline v4 service package

    pip install --no-cache-dir --user assemblyline-v4-service
    
  3. Add your service development directory path (ie. /home/ubuntu/assemblyline-v4-service) to the PYTHONPATH environment variable

Using the run_service_once.py script

Steps

  1. Ensure the current working directory is the root of the service directory of the service to be run

    cd assemblyline-service-<service name>
    
  2. From a terminal, run the run_service_once script, where <service path> is the path to the service module and <file path> is the path of the file to be processed

    python3.9 -m assemblyline_v4_service.dev.run_service_once <service path> <file path>
    

    python3.7 -m assemblyline_v4_service.dev.run_service_once <service path> <file path>

    
    
    
  3. The output of the service (result.json and extracted/supplementary files) will be located in a directory where the input file is located

Example of running the ResultSample service

  1. Change working directory to root of the service:

    cd assemblyline_result_sample_service
    
  2. From a terminal, run the run_service_once script

    python3.9 -m assemblyline_v4_service.dev.run_service_once assemblyline_result_sample_service.result_sample.ResultSample /home/ubuntu/testfile.doc
    

    python3.7 -m assemblyline_v4_service.dev.run_service_once assemblyline_result_sample_service.result_sample.ResultSample /home/ubuntu/testfile.doc

    
    
    
  3. The results.json and any extracted/supplementary files will be outputted to /home/ubuntu/testfile_resultsample

Project details


Release history Release notifications | RSS feed