Link Search Menu Expand Document Documentation Menu

substringBeforeLast()

The substringBeforeLast() function is used to extract the portion of a string that precedes the last occurrence of a specified delimiter. It takes two arguments:

  1. The first argument is either a literal string or a JSON pointer that represents the source string.

  2. The second argument is the delimiter string to search for within the first argument.

If the delimiter is found, the function returns the portion of the string before the last occurrence of the delimiter. If the delimiter is not found, the original string is returned. If the source resolves to null, the function returns null. If the delimiter is null or empty, the original string is returned.

For example, to remove the file extension from a filename field, use the substringBeforeLast() function as follows:

'substringBeforeLast(/filename, ".")'

If the /filename field contains archive.tar.gz, the function returns archive.tar.

Alternatively, you can use a literal string as the first argument:

'substringBeforeLast("one-two-three", "-")'

The function returns one-two because it extracts the portion of the string before the last - character.

The substringBeforeLast() function performs a case-sensitive search.

Example

The following pipeline uses the substringBeforeLast() function to extract the directory path from a full file path. It adds the extracted directory path as a new field called directory:

substring-before-last-demo:
  source:
    http:
      ssl: false

  processor:
    - add_entries:
        entries:
          - key: directory
            value_expression: 'substringBeforeLast(/filepath, "/")'

  sink:
    - opensearch:
        hosts: ["https://opensearch:9200"]
        insecure: true
        username: admin
        password: admin_password
        index_type: custom
        index: demo-index-%{yyyy.MM.dd}

You can test the pipeline using the following command:

curl -sS -X POST "http://localhost:2021/log/ingest" \
  -H "Content-Type: application/json" \
  -d '[
        {"filepath":"/var/log/syslog"},
        {"filepath":"/home/user/docs/report.pdf"}
      ]'

The documents stored in OpenSearch contain the following information:

{
  ...
  "hits": {
    "total": {
      "value": 2,
      "relation": "eq"
    },
    "max_score": 1,
    "hits": [
      {
        "_index": "demo-index-2026.03.13",
        "_id": "abc123",
        "_score": 1,
        "_source": {
          "filepath": "/var/log/syslog",
          "directory": "/var/log"
        }
      },
      {
        "_index": "demo-index-2026.03.13",
        "_id": "def456",
        "_score": 1,
        "_source": {
          "filepath": "/home/user/docs/report.pdf",
          "directory": "/home/user/docs"
        }
      }
    ]
  }
}
350 characters left

Have a question? .

Want to contribute? or .