[k8s provider] Look up metadata based on file path #5164

ycombinator · 2024-07-17T23:45:47Z

Describe the enhancement:

Have the Kubernetes provider look up metadata based on file path, similar to what the add_kubernetes_metadata provider in Beats is capable of doing.

Describe a specific use case for the enhancement or feature:

From @henrikno:

We have this bug where we lose logs from short lived containers (e.g. if they're crashing). It seems to be like it happens when there's some delay in autodiscover, i.e. by the time it has discovered the container via k8s, and updated the policy to filebeat, the file has been rotated to the new "attempt", which has a new container id.
So I was looking if it's possible to "reverse" it, and look for the files, and look up metadata based on the file path. And that looks like what add_kubernetes_metadata is doing.

The text was updated successfully, but these errors were encountered:

pchila · 2025-01-23T12:39:34Z

I had a look at the kubernetes provider and the inputs generated to ingest logs for containers and there may be a simpler solution to ensure that agents generates filestream inputs that match rotated container log files for container that crash/terminate quickly.

Looking at elastic-agent standalone k8s manifest, agent creates an input per (pod, container) pair and then one datastream per container (it's the same in the kustomize files):

elastic-agent/deploy/kubernetes/elastic-agent-standalone-kubernetes.yaml

Lines 353 to 381 in eedce01

    
                 # Input ID allowing Elastic Agent to track the state of this input. Must be unique. 
        
                 - id: container-log-${kubernetes.pod.name}-${kubernetes.container.id} 
        
                   type: filestream 
        
                   use_output: default 
        
                   meta: 
        
                     package: 
        
                       name: kubernetes 
        
                       version: 1.52.0 
        
                   data_stream: 
        
                     namespace: default 
        
                   streams: 
        
                     # Stream ID for this data stream allowing Filebeat to track the state of the ingested files. Must be unique. 
        
                     # Each filestream data stream creates a separate instance of the Filebeat filestream input. 
        
                     - id: container-log-${kubernetes.pod.name}-${kubernetes.container.id} 
        
                       data_stream: 
        
                         dataset: kubernetes.container_logs 
        
                         type: logs 
        
                       prospector.scanner.symlinks: true 
        
                       parsers: 
        
                         - container: ~ 
        
                         # - ndjson: 
        
                         #     target: json 
        
                         # - multiline: 
        
                         #     type: pattern 
        
                         #     pattern: '^\[' 
        
                         #     negate: true 
        
                         #     match: after 
        
                       paths: 
        
                         - /var/log/containers/*${kubernetes.container.id}.log

In elastic agent helm chart, a single input is created with one datastream per container id (modeled after what the k8s integration does for fleet-managed agents):

elastic-agent/deploy/helm/elastic-agent/templates/integrations/_kubernetes/_kubernetes_logs_containers.tpl

Lines 14 to 33 in eedce01

    
           - id: filestream-container-logs 
        
             type: filestream 
        
             data_stream: 
        
               namespace: {{ .Values.kubernetes.namespace }} 
        
             use_output: {{ .Values.kubernetes.output }} 
        
             streams: 
        
             - id: kubernetes-container-logs-${kubernetes.pod.name}-${kubernetes.container.id} 
        
               data_stream: 
        
                 dataset: kubernetes.container_logs 
        
               paths: 
        
                 - '/var/log/containers/*${kubernetes.container.id}.log' 
        
               prospector.scanner.symlinks: {{ dig "vars" "symlinks" true .Values.kubernetes.containers.logs }} 
        
               parsers: 
        
                 - container: 
        
                     stream: {{ dig "vars" "stream" "all" .Values.kubernetes.containers.logs }} 
        
                     format: {{ dig "vars" "format" "auto" .Values.kubernetes.containers.logs }} 
        
                 {{- with $.Values.kubernetes.containers.logs.additionalParsersConfig }} 
        
                 {{ . | toYaml | nindent 6 }} 
        
                 {{- end }} 
        
               processors:

The files in /var/log/containers are symlinks to the actual container logs under /var/log/pods/<namespace>_<pod name>_<pod_uid>/<container name>/*.log.

For example for pod fritzy-printer-6d65dcc658-mpzpr in namespace default container logs for echoer container are:

root@fastcontainers-control-plane:/var/log# ll containers/fritzy-printer-*.log
lrwxrwxrwx 1 root root 104 Jan 23 12:23 containers/fritzy-printer-6d65dcc658-mpzpr_default_echoer-3b25af57297705d7c0986df8e4817e1225c212d5df6dad27104b17357de92c9e.log -> /var/log/pods/default_fritzy-printer-6d65dcc658-mpzpr_67f3da94-d6d0-47b7-a60b-ea715174927c/echoer/56.log

The directory /var/log/pods/default_fritzy-printer-6d65dcc658-mpzpr_67f3da94-d6d0-47b7-a60b-ea715174927c/echoer/ contains all the available logs for the same container (even the rotated ones from previous container runs until they are deleted by kubelet).

In order to capture all the available logs it should be enough to change the paths in the container logs input, creating a single input and a stream per (pod,container name) pair, something similar to the yaml below

      # Input ID allowing Elastic Agent to track the state of this input. Must be unique.
      - id: filestream-container-logs
        type: filestream
        use_output: default
        ...
        streams:
          # Stream ID for this data stream allowing Filebeat to track the state of the ingested files. Must be unique.
          # Each filestream data stream creates a separate instance of the Filebeat filestream input.
          - id: container-log-${kubernetes.pod.namespace}-${kubernetes.pod.name}-${kubernetes.container.id}
            data_stream:
              dataset: kubernetes.container_logs
              type: logs
            parsers:
              - container: ~
              # - ndjson:
              #     target: json
              # - multiline:
              #     type: pattern
              #     pattern: '^\['
              #     negate: true
              #     match: after
            paths:
              - /var/log/pods/${kubernetes.pod.namespace}_${kubernetes.pod.name}_${kubernetes.pod.uid}/${kubernetes.container.name}/*.log

Note the addition of ${kubernetes.pod.namespace} in the stream id to ensure that agent can distinguish between pods with the same name specifying a container with the same name in different namespaces (could happen in case of pods with predictable network name like statefulsets).

I am going to modify the elastic-agent helm chart to check that the new input spec works as expected.

Huge props to @pkoutsovasilis that assisted me in the analysis of kubernetes provider and suggested looking into adjusting the paths passed to filebeat

pchila · 2025-01-24T09:48:32Z

@henrikno Could please have a look at my comment above and check if the proposed solution could help with the rotated container logs ?
This solution has a much smaller impact than adding a different mechanism for discovering pods and should still ingest all container log files available on the node.

ycombinator added the enhancement New feature or request label Jul 17, 2024

ycombinator assigned pchila Dec 23, 2024

pchila linked a pull request Jan 23, 2025 that will close this issue

Switch k8s input paths to /var/log/pods/* to ingest rotated container logs #6583

Draft

4 tasks

pchila linked a pull request Jan 28, 2025 that will close this issue

Switch container logs input to /var/log/pod/* path elastic/integrations#12500

Draft

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[k8s provider] Look up metadata based on file path #5164

[k8s provider] Look up metadata based on file path #5164

ycombinator commented Jul 17, 2024

pchila commented Jan 23, 2025 •

edited

Loading

pchila commented Jan 24, 2025

[k8s provider] Look up metadata based on file path #5164

[k8s provider] Look up metadata based on file path #5164

Comments

ycombinator commented Jul 17, 2024

pchila commented Jan 23, 2025 • edited Loading

pchila commented Jan 24, 2025

pchila commented Jan 23, 2025 •

edited

Loading