首页 > 其他分享 >ELK日志收集&&日志收集方案

ELK日志收集&&日志收集方案

时间:2023-05-08 12:45:27浏览次数:45  
标签:ELK filebeat 收集 tomcat name 日志 logstash log

31. ELK日志收集

日志分析系统 - k8s部署ElasticSearch集群 - 帝都攻城狮 - 博客园 (cnblogs.com)

https://blog.csdn.net/miss1181248983/article/details/113773943

31.1 日志收集方式

  1.node节点收集,基于daemonset部署日志收集进程,实现json-file类型(标准输出/dev/stdout、错误输出/dev/stderr)日志收集。
  2.使用sidcar容器(一个pod多容器)收集当前pod内一个或者多个业务容器的日志(通常基于emptyDir实现业务容器与sidcar之间的日志共亭)。
  3.在容器内置日志收集服务进程。

31.2 daemonset日志收集

logstach容器内收集-->kafka-zk-->logstach过滤写入-->ES-cluster

  • 把日志挂载到宿主机进行收集
  基于daemonset运行日志收集服务,主要收集以下类型日志:
  1.node节点收集,基于daemonset部署日志收集进程,实现json-file类型(标准输出/dev/stdout、错误输出/dev/stderr)日志收集,即应用程序产生的标准输出和错误输出的日志。
  因为容器里的日志都是输出到标准输出、错误输出,然后需要提前把容器里的日志驱动与日志类型改成jsonfile类型
  实现方式:
  将容器内的日志改好jsonfile之后挂载到宿主机,在把宿主机的日志挂载到logstash中进行过滤,这样就收集起来了
  • 宿主机系统日志等以日志文件形式保存的日志
对比类型containerddocker
日志存储路径 真实路径:/var/log/pods/CONTAINER_NAMEs #真实路径<br />软连接:同时kubelet也会在/var/log/containers目录下创建软链接指向/var/log/pods/CONTAINER_NAMEs #真实路径<br />软连接:同时kubelet也会在/var/log/containers目录下创建软链接指向/var/log/pods/CONTAINER_NAMEs #真实路径<br />软连接:同时kubelet也会在/var/log/containers目录下创建软链接指向/var/log/pods/CONTAINER_NAMES 真实路径:/var/lib/docker/containers/软连接会在和创建软连接指向软连接会在和创建软连接指向CONTAINERID<br/>软连接:kubelet会在/var/log/pods和/var/log/containers创建软连接指向/var/lib/docker/containers/CONTAINERID
日志配置参数 配置文件:/etc/systemd/system/kubelet.service
配置参数:
--container-log-max-files=5
--container-log-max-size="10OMi"
--logging-format="json"
配置文件:/etc/docker/daemon.json
参数:"log-driver" : "json-file",
"log-opts" :{
"max-file" : "5",
"max-size": "100m"
}
  • Dockfile
  root@k8s-master1:~1.logstash-image-Dockerfile# cat Dockerfile
  FROM logstash:7.12.1
   
  USER root
  WORKDIR /usr/share/logstash
  #RUN rm -rf config/logstash-sample.conf
  ADD logstash.yml /usr/share/logstash/config/logstash.yml
  ADD logstash.conf /usr/share/logstash/pipeline/logstash.conf
  • logstash.conf
  #收集日志的路径为宿主机
  root@k8s-master1:~1.logstash-image-Dockerfile# cat logstash.conf
  input {
  file {
  #这个是docker路径
  #path => "/var/lib/docker/containers/*/*-json.log" #docker
  #containerd路径
  path => "/var/log/pods/*/*/*.log"
  #如何之前有存在的日志就从头收集,默认是从结尾收集
  start_position => "beginning"
  #如果是containerd类型就加上jsonfile-daemonset-applog
  type => "jsonfile-daemonset-applog"
  }
   
  file {
  #把宿主机的系统日志也收集过来 在k8s YAML中定义
  path => "/var/log/*.log"
  start_position => "beginning"
  #如果是系统日志就加上这个类型jsonfile-daemonset-syslog
  type => "jsonfile-daemonset-syslog"
  }
  }
   
  output {
  if [type] == "jsonfile-daemonset-applog" {
  kafka {
  #k8s YAML中定义的KAFKA变量
  bootstrap_servers => "${KAFKA_SERVER}"
  #k8s YAML中定义的TOPIC_ID
  topic_id => "${TOPIC_ID}"
  batch_size => 16384 #logstash每次向ES传输的数据量大小,单位为字节
  #编码json
  codec => "${CODEC}"
  } }
   
  if [type] == "jsonfile-daemonset-syslog" {
  kafka {
  bootstrap_servers => "${KAFKA_SERVER}"
  topic_id => "${TOPIC_ID}"
  batch_size => 16384
  codec => "${CODEC}" #系统日志不是json格式
  }}
  }
  • logstash.yaml
  root@k8s-master1:~1.logstash-image-Dockerfile# cat logstash.yml
  http.host: "0.0.0.0"
  #注释掉这个地址-xpack是一个安全认证
  #xpack.monitoring.elasticsearch.hosts: [ "http://elasticsearch:9200" ]
  • build-command.sh
  root@k8s-master1:~1.logstash-image-Dockerfile# cat build-commond.sh
  #!/bin/bash
   
  #docker build -t harbor.nbrhce.com/baseimages/logstash:v7.12.1-json-file-log-v4 .
   
  #docker push harbor.nbrhce.com/baseimages/logstash:v7.12.1-json-file-log-v4
   
  nerdctl build -t harbor.nbrhce.com/baseimages/logstash:v7.12.1-json-file-log-v1 .
   
  nerdctl push harbor.nbrhce.com/baseimages/logstash:v7.12.1-json-file-log-v1
  • k8s YAML DaemonSet-logstash容器内收集
  root@k8s-master1:~/20220821/ELK/1.daemonset-logstash# cat 2.DaemonSet-logstash.yaml
  apiVersion: apps/v1
  kind: DaemonSet
  metadata:
  name: logstash-elasticsearch
  namespace: kube-system
  labels:
  k8s-app: logstash-logging
  spec:
  selector:
  matchLabels:
  name: logstash-elasticsearch
  template:
  metadata:
  labels:
  name: logstash-elasticsearch
  spec:
  tolerations:
  # this toleration is to have the daemonset runnable on master nodes
  # remove it if your masters can't run pods
  - key: node-role.kubernetes.io/master
  operator: Exists
  effect: NoSchedule
  containers:
  - name: logstash-elasticsearch
  image: harbor.nbrhce.com/baseimages/logstash:v7.12.1-json-file-log-v1
  env:
  - name: "KAFKA_SERVER"
  value: "172.31.4.101:9092,172.31.4.102:9092,172.31.4.103:9092"
  - name: "TOPIC_ID"
  value: "jsonfile-log-topic"
  - name: "CODEC"
  value: "json"
  # resources:
  # limits:
  # cpu: 1000m
  # memory: 1024Mi
  # requests:
  # cpu: 500m
  # memory: 1024Mi
  volumeMounts:
  - name: varlog #定义宿主机系统日志挂载路径
  mountPath: /var/log #宿主机系统日志挂载点
  - name: varlibdockercontainers #定义容器日志挂载路径,和logstash配置文件中的收集路径保持一直
  #mountPath: /var/lib/docker/containers #docker挂载路径
  mountPath: /var/log/pods #containerd挂载路径,此路径与logstash的日志收集路径必须一致
  readOnly: false
  terminationGracePeriodSeconds: 30
  volumes:
  #宿主机系统日志挂载logstash容器这样就能收集了
  - name: varlog
  hostPath:
  path: /var/log
  #宿主机containerd的日志挂载到logstash中
  - name: varlibdockercontainers
  hostPath:
  path: /var/lib/docker/containers
  path: /var/log/pods
  • logstach过滤日志 conf
  #这个是单独过滤日志的然后传给es集群
  root@k8s-master1:~1.daemonset-logstash# cat 3.logsatsh-daemonset-jsonfile-kafka-to-es.conf
  input {
  kafka {
  #kafka集群地址
  bootstrap_servers => "172.31.4.101:9092,172.31.4.102:9092,172.31.4.103:9092"
  #来自于哪个topics
  topics => ["jsonfile-log-topic"]
  #编码是json
  codec => "json"
  }
  }
   
  output {
  #if [fields][type] == "app1-access-log" {
  if [type] == "jsonfile-daemonset-applog" {
  elasticsearch {
  hosts => ["172.31.2.101:9200","172.31.2.102:9200"]
  #如果这个索引不存在那么会自动创建
  index => "jsonfile-daemonset-applog-%{+YYYY.MM.dd}"
  }}
   
  if [type] == "jsonfile-daemonset-syslog" {
  elasticsearch {
  hosts => ["172.31.2.101:9200","172.31.2.102:9200"]
  index => "jsonfile-daemonset-syslog-%{+YYYY.MM.dd}"
  }}
   
  }

31.3 Sidcar容器日志收集

  • 概述 轻量级日志收集容器
  使用sidcar容器一个pod多容器收集当前pod内一个或多个业务容器的日志、通常基于emptyDir实现业务容器与sidcar之间的日志共享
  容器之间的文件系统是隔离的,通常emptyDir来实现日志的共享,应该就是把业务容器的日志路径挂载到emptyDir,sidcar容器收集日志的路径就是这个emptyDir
  优点:这样收集日志的好处就是可以精细化服务的日志
  缺点:就是占用资源要是有旧业务容器还需要改造POD添加sidcar容器
  • Dockerfile制作镜像
  root@k8s-master1:~2.sidecar-logstash/1.logstash-image-Dockerfile# cat Dockerfile
  FROM logstash:7.12.1
   
  USER root
  WORKDIR /usr/share/logstash
  #RUN rm -rf config/logstash-sample.conf
  ADD logstash.yml /usr/share/logstash/config/logstash.yml
  ADD logstash.conf /usr/share/logstash/pipeline/logstash.conf
  • logstash.yaml
  root@k8s-master1:~2.sidecar-logstash/1.logstash-image-Dockerfile# cat logstash.yml
  http.host: "0.0.0.0"
  #xpack.monitoring.elasticsearch.hosts: [ "http://elasticsearch:9200" ]
  • logstash.conf
  root@k8s-master1:~2.sidecar-logstash/1.logstash-image-Dockerfile# cat logstash.conf
  input {
  file {
  path => "/var/log/applog/catalina.out"
  start_position => "beginning"
  type => "app1-sidecar-catalina-log"
  }
  file {
  path => "/var/log/applog/localhost_access_log.*.txt"
  start_position => "beginning"
  type => "app1-sidecar-access-log"
  }
  }
   
  output {
  if [type] == "app1-sidecar-catalina-log" {
  kafka {
  bootstrap_servers => "${KAFKA_SERVER}"
  topic_id => "${TOPIC_ID}"
  batch_size => 16384 #logstash每次向ES传输的数据量大小,单位为字节
  codec => "${CODEC}"
  } }
   
  if [type] == "app1-sidecar-access-log" {
  kafka {
  bootstrap_servers => "${KAFKA_SERVER}"
  topic_id => "${TOPIC_ID}"
  batch_size => 16384
  codec => "${CODEC}"
  }}
  }
  • tomcat.yaml
  root@k8s-master1:~/20220821/ELK/2.sidecar-logstash# cat 2.tomcat-app1.yaml
  kind: Deployment
  #apiVersion: extensions/v1beta1
  apiVersion: apps/v1
  metadata:
  labels:
  app: magedu-tomcat-app1-deployment-label
  name: magedu-tomcat-app1-deployment #当前版本的deployment 名称
  namespace: magedu
  spec:
  replicas: 3
  selector:
  matchLabels:
  app: magedu-tomcat-app1-selector
  template:
  metadata:
  labels:
  app: magedu-tomcat-app1-selector
  spec:
  containers:
  - name: sidecar-container
  image: harbor.magedu.net/baseimages/logstash:v7.12.1-sidecar
  imagePullPolicy: IfNotPresent
  #imagePullPolicy: Always
  #将传递参数给kafka
  env:
  - name: "KAFKA_SERVER"
  value: "172.31.4.101:9092,172.31.4.102:9092,172.31.4.103:9092"
  - name: "TOPIC_ID"
  value: "tomcat-app1-topic"
  - name: "CODEC"
  value: "json"
  #挂载到容器里这个路径--配置文件与其对应这个路径
  volumeMounts:
  - name: applogs
  mountPath: /var/log/applog
  - name: magedu-tomcat-app1-container
  image: registry.cn-hangzhou.aliyuncs.com/zhangshijie/tomcat-app1:v1
  imagePullPolicy: IfNotPresent
  #imagePullPolicy: Always
  ports:
  - containerPort: 8080
  protocol: TCP
  name: http
  env:
  - name: "password"
  value: "123456"
  - name: "age"
  value: "18"
  resources:
  limits:
  cpu: 1
  memory: "512Mi"
  requests:
  cpu: 500m
  memory: "512Mi"
  volumeMounts:
  - name: applogs
  mountPath: /apps/tomcat/logs
  startupProbe:
  httpGet:
  path: /myapp/index.html
  port: 8080
  initialDelaySeconds: 5 #首次检测延迟5s
  failureThreshold: 3 #从成功转为失败的次数
  periodSeconds: 3 #探测间隔周期
  readinessProbe:
  httpGet:
  #path: /monitor/monitor.html
  path: /myapp/index.html
  port: 8080
  initialDelaySeconds: 5
  periodSeconds: 3
  timeoutSeconds: 5
  successThreshold: 1
  failureThreshold: 3
  livenessProbe:
  httpGet:
  #path: /monitor/monitor.html
  path: /myapp/index.html
  port: 8080
  initialDelaySeconds: 5
  periodSeconds: 3
  timeoutSeconds: 5
  successThreshold: 1
  failureThreshold: 3
  volumes:
  - name: applogs #定义通过emptyDir实现业务容器与sidecar容器的日志共享,以让sidecar收集业务容器中的日志
  emptyDir: {}

31.4 filebeat容器内置进程收集

  • Dockerfile 在做业务镜像的时候添加进去
  root@k8s-master1:~/20220821/ELK/3.container-filebeat-process/1.webapp-filebeat-image-Dockerfile# cat Dockerfile
  #tomcat web1
  FROM harbor.magedu.net/pub-images/tomcat-base:v8.5.43
   
  ADD catalina.sh /apps/tomcat/bin/catalina.sh
  ADD server.xml /apps/tomcat/conf/server.xml
  #ADD myapp/* /data/tomcat/webapps/myapp/
  ADD myapp.tar.gz /data/tomcat/webapps/myapp/
  ADD run_tomcat.sh /apps/tomcat/bin/run_tomcat.sh
  ADD filebeat.yml /etc/filebeat/filebeat.yml
  RUN chown -R tomcat.tomcat /data/ /apps/
  #ADD filebeat-7.5.1-x86_64.rpm /tmp/
  #RUN cd /tmp && yum localinstall -y filebeat-7.5.1-amd64.deb
   
  EXPOSE 8080 8443
   
  CMD ["/apps/tomcat/bin/run_tomcat.sh"]
  • filebeat配置文件
  root@k8s-master1:~1.webapp-filebeat-image-Dockerfile# cat filebeat.yml
  #采集日志
  filebeat.inputs:
  - type: log
  #这个enabled是启用这段配置、不是true就不会加载
  enabled: true
  paths:
  #收集业务容器日志-运行日志
  - /apps/tomcat/logs/catalina.out
  fields:
  #定义的类型与名字
  type: filebeat-tomcat-catalina
  - type: log
  #在定义一个类型访问日志
  enabled: true
  paths:
  - /apps/tomcat/logs/localhost_access_log.*.txt
  fields:
  type: filebeat-tomcat-accesslog
  #这里是默认的配置文件 可以不用动
  filebeat.config.modules:
  path: ${path.config}/modules.d/*.yml
  reload.enabled: false
  setup.template.settings:
  index.number_of_shards: 1
  setup.kibana:
   
  #这里是输出到哪里
  output.kafka:
  hosts: ["172.31.4.101:9092"]
  #确认ack保证数据完整性
  required_acks: 1
  #写的kafka中的topic
  topic: "filebeat-magedu-app1"
  #开启压缩节省带宽但是占CPU
  compression: gzip
  #最大字节不能超过这个值
  max_message_bytes: 1000000
  #output.redis:
  # hosts: ["172.31.2.105:6379"]
  # key: "k8s-magedu-app1"
  # db: 1
  # timeout: 5
  # password: "123456"
  • 运行命令
  root@k8s-master1:~1.webapp-filebeat-image-Dockerfile# cat run_tomcat.sh
  #!/bin/bash
  #echo "nameserver 223.6.6.6" > /etc/resolv.conf
  #echo "192.168.7.248 k8s-vip.example.com" >> /etc/hosts
   
  /usr/share/filebeat/bin/filebeat -e -c /etc/filebeat/filebeat.yml -path.home /usr/share/filebeat -path.config /etc/filebeat -path.data /var/lib/filebeat -path.logs /var/log/filebeat &
  su - tomcat -c "/apps/tomcat/bin/catalina.sh start"
  tail -f /etc/hosts
  • k8s filebeat 账号
  #如果你是通过daemset部署filebeat那么是需要授权的但是目前的filebeat是在pod中运行的这个服务账号可以先不执行
  root@k8s-master1:~3.container-filebeat-process# cat 2.filebeat-serviceaccount.yaml
  ---
  apiVersion: rbac.authorization.k8s.io/v1
  kind: ClusterRole
  metadata:
  name: filebeat-serviceaccount-clusterrole
  labels:
  k8s-app: filebeat-serviceaccount-clusterrole
  rules:
  - apiGroups: [""] # "" indicates the core API group
  resources:
  - namespaces
  - pods
  - nodes
  verbs:
  - get
  - watch
  - list
   
  ---
  apiVersion: rbac.authorization.k8s.io/v1
  kind: ClusterRoleBinding
  metadata:
  name: filebeat-serviceaccount-clusterrolebinding
  subjects:
  - kind: ServiceAccount
  name: default
  namespace: magedu
  roleRef:
  kind: ClusterRole
  name: filebeat-serviceaccount-clusterrole
  apiGroup: rbac.authorization.k8s.io
  • YAML
  root@k8s-master1:~3.container-filebeat-process# cat 3.tomcat-app1.yaml
  kind: Deployment
  #apiVersion: extensions/v1beta1
  apiVersion: apps/v1
  metadata:
  labels:
  app: magedu-tomcat-app1-filebeat-deployment-label
  name: magedu-tomcat-app1-filebeat-deployment
  namespace: magedu
  spec:
  replicas: 1
  selector:
  matchLabels:
  app: magedu-tomcat-app1-filebeat-selector
  template:
  metadata:
  labels:
  app: magedu-tomcat-app1-filebeat-selector
  spec:
  containers:
  - name: magedu-tomcat-app1-filebeat-container
  image: harbor.magedu.net/magedu/tomcat-app1:v1-filebeat
  imagePullPolicy: IfNotPresent
  #imagePullPolicy: Always
  ports:
  - containerPort: 8080
  protocol: TCP
  name: http
  env:
  - name: "password"
  value: "123456"
  - name: "age"
  value: "18"
  resources:
  limits:
  cpu: 1
  memory: "512Mi"
  requests:
  cpu: 500m
  memory: "512Mi"
  • service.yaml
  #做测试
  root@k8s-master1:~3.container-filebeat-process# cat 4.tomcat-service.yaml
  ---
  kind: Service
  apiVersion: v1
  metadata:
  labels:
  app: magedu-tomcat-app1-filebeat-service-label
  name: magedu-tomcat-app1-filebeat-service
  namespace: magedu
  spec:
  type: NodePort
  ports:
  - name: http
  port: 80
  protocol: TCP
  targetPort: 8080
  nodePort: 30092
  selector:
  app: magedu-tomcat-app1-filebeat-selector
  • logstash 的配置文件传给ES
  root@k8s-master1:~3.container-filebeat-process# cat 5.logstash-filebeat-process-kafka-to-es.conf
  input {
  kafka {
  bootstrap_servers => "172.31.4.101:9092,172.31.4.102:9092,172.31.4.103:9092"
  topics => ["filebeat-magedu-app1"]
  codec => "json"
  }
  }
   
  output {
  if [fields][type] == "filebeat-tomcat-catalina" {
  elasticsearch {
  hosts => ["172.31.2.101:9200","172.31.2.102:9200"]
  index => "filebeat-tomcat-catalina-%{+YYYY.MM.dd}"
  }}
   
  if [fields][type] == "filebeat-tomcat-accesslog" {
  elasticsearch {
  hosts => ["172.31.2.101:9200","172.31.2.102:9200"]
  index => "filebeat-tomcat-accesslog-%{+YYYY.MM.dd}"
  }}
   
  }

标签:ELK,filebeat,收集,tomcat,name,日志,logstash,log
From: https://www.cnblogs.com/gaoyuechen/p/17381367.html

相关文章

  • SpringBoot添加日志
    SpringBoot添加日志前言SpringBoot使用ApacheCommons日志记录进行所有内部日志记录。SpringBoot的默认配置支持使用JavaUtilLogging,Log4j2和Logback。使用这些,可以配置控制台日志记录以及文件日志记录。如果使用的是SpringBootStarters,Logback将为日志记录提供良好的支......
  • burp的三种日志格式
    1、xml格式(1)items作为根节点,具有burpVersion、exportTime两个属性,分别表示burp版本和导出时间(2)item作为itmes的子节点,表示一组请求-响应(3)item内包括多个字节点:time,表示时间,cst格式url,host,例如static.deepl.com,该节点还具有一个属性ipportprotocol,例如httpsmethod......
  • 【Azure 应用服务】Azure JS Function 异步方法中执行SQL查询后,Callback函数中日志无
    问题描述开发AzureJSFunction(NodeJS),使用mssql组件操作数据库。当SQL语句执行完成后,在Callback函数中执行日志输出 context.log("..."),遇见如下错误:Warning:Unexpectedcallto'log'onthecontextobjectafterfunctionexecutionhascompleted.Pleasecheck......
  • kubernetes|EFK日志系统
    前言对于任何基础设施或后端服务系统,日志都是极其重要的,借助日志可以分析程序的运行状态、用户的操作行为等。最早常说的日志监控系统是ELK,即ElasticSearch(负责数据检索)、Logstash(负责数据收集)、Kibana(负责数据展示)三个软件的组合,随着技术的发展,又出现了很多新的名词,比如EFK,......
  • wazuh正则规则,匹配没有用公司打印机打印的日志
     <!--检查使用公司外部打印机打印的行为--><groupname="天擎"><ruleid="100020"level="5"><decoded_as>json</decoded_as><description>TianQing</description><fieldnam......
  • 利用Navicat的历史日志查询表的索引信息(还可以查询很多系统级别的信息)
    1、使用前提所有的能用Navicat连接的数据库都可以使用这个方法DDL/DML语句都有2、Navicat中的历史日志3、比如查询mysql的表的索引先打开“历史记录”选择一张表-设计表查看“历史记录”,点击“暂停”......
  • webservie 客户端读取服务器端日志例子(以网页展现)
    importjava.io.BufferedInputStream;下面是一个完整的servlet,直接复制它既可以使用,只需要修改红色部分路径即可,本例使用方法:在浏览器直接键入URL:即可展现日志,如下:http://localhost:8888/BPMDemo/BPMClientLogService?point=p1代码:importjava.io.BufferedReader;imp......
  • 日志AOP
    @Target(ElementType.METHOD)//注解放置的目标位置,METHOD是可注解在方法级别上@Retention(RetentionPolicy.RUNTIME)//注解在哪个阶段执行@Documentedpublic@interfaceOperLog{StringoperModul()default"";//操作模块OperLogTypeEnumoperType();//操作类......
  • 自动化框架——日志模块
    日志模块的使用(python的logging模块)一:简单使用   学习原因:学习logging模块是为了更直观的调试代码,虽然有prin语句可以调试,但是在批量执行自动化用例时需要logging模块来调试。如何简单使用:导入日志模块设置basicConfig就行,设置日志级别,日志格式,日志写入模式,日志文件名就可......
  • SLS日志查询遇到的一些问题
    SLS日志查询遇到的一些问题根据执行时间查询结果不准确的问题原因:索引类型造成的;解决:进入索引设置,改为double即可;注意,只对更改后的日志生效,之前的旧日志不生效;一些常用查询语句查询执行时间大于5秒的*and__topic__:访问记录日志andoperation_hours>5查询平均执行时......