yawyd313 e4538f3d9b 2025 | 5 days ago | |
---|---|---|
argo-helm | 11 months ago | |
argo-workflow | 1 month ago | |
bak_kafka | 1 month ago | |
bak_kafka_ha | 1 month ago | |
certmanager | 11 months ago | |
ch | 5 days ago | |
ch-cold | 1 month ago | |
ch-keeper | 11 months ago | |
ch-single | 1 month ago | |
chaos-mesh | 1 month ago | |
ck-cold | 1 year ago | |
ck-ui | 11 months ago | |
clickhouse | 5 days ago | |
clickhouse_etc | 1 year ago | |
createcert | 1 year ago | |
data | 5 days ago | |
dataroom | 1 month ago | |
deepflow | 1 month ago | |
dev | 11 months ago | |
dinky | 11 months ago | |
docker-registry | 1 month ago | |
droneci | 1 month ago | |
es | 1 year ago | |
flink | 11 months ago | |
flink-kubernetes-operator | 11 months ago | |
fluentd | 1 year ago | |
gogs | 5 days ago | |
grafana | 1 month ago | |
halo | 1 month ago | |
hdfs | 1 month ago | |
i6000pusher | 5 days ago | |
ingress-nginx | 1 month ago | |
jaeger | 1 year ago | |
jenkins | 11 months ago | |
kafka | 5 days ago | |
kafka-delay | 11 months ago | |
kafka-ui | 5 days ago | |
kafka2rmq | 5 days ago | |
kafka_bitnami | 1 year ago | |
kibana | 1 year ago | |
loki-distributed | 11 months ago | |
metallb | 1 month ago | |
metric-server | 11 months ago | |
minio | 11 months ago | |
mysql | 5 days ago | |
mysql.bak | 1 month ago | |
neo4j-helm | 11 months ago | |
network-policy | 1 year ago | |
nginx | 1 year ago | |
nginxgatewayfabric | 10 months ago | |
ngx-proxy | 1 month ago | |
oas-session | 1 month ago | |
ob-agent-stream | 5 days ago | |
ob-event | 1 month ago | |
ob-proxy | 5 days ago | |
ob-server | 5 days ago | |
ob-server_bak | 1 year ago | |
ob-tools | 1 month ago | |
ob2 | 11 months ago | |
obui | 1 month ago | |
openebs | 11 months ago | |
operator | 1 month ago | |
otel | 5 days ago | |
otel-data-replay | 1 month ago | |
prometheus | 1 month ago | |
pycli | 11 months ago | |
rbac | 11 months ago | |
redis | 11 months ago | |
rocketmq | 5 days ago | |
rook | 11 months ago | |
scripts | 11 months ago | |
superset | 5 days ago | |
swagger-ui | 5 days ago | |
tempo | 1 year ago | |
tempo-distributed | 11 months ago | |
test-oas | 1 month ago | |
timescaledb | 5 days ago | |
trace-stream-creator | 1 month ago | |
victoria-metrics-cluster | 1 month ago | |
xxl-jobs | 1 month ago | |
.gitignore | 11 months ago | |
README.md | 1 year ago | |
check.sh | 11 months ago | |
ingress-ceph-controller.yaml | 11 months ago | |
quota-observe.yaml | 1 year ago | |
storageClass_local.yaml | 11 months ago | |
storageClass_tmp.yaml | 11 months ago |
[TOC]
使用helm chart/deploy/ingress-nginx
安装
helm install ingress-nginx . -n ingress-nginx
如80、443端口不可用,修改contorller.hostPort
修改controller.ingressResource.enable/default
为true
如node
很多,将controller.kind
改为deployment ,并设置nodeSelector,不然ingress解析不到
kubectl apply -f deploy/openebs/openebs-operator.yaml
deploy/clickhouse
global.storageClass
shard
和replica
auth.username
, auth.password
ingress
修改persistence.storangeClass: openebs-hostpath
helm install clickhouse . -n obse部署nginx ingress controller(如有跳过)部署nginx ingress controller(如有跳过)部署nginx ingress controller(如有跳过)rve --create-namespace
使用deploy/tempo
开启otel receiver
otlp:
protocols:
grpc:
endpoint: "0.0.0.0:4317"
http:
endpoint: "0.0.0.0:4318"
配置metrics_generator_processors
metrics_generator_processors:
- 'service-graphs'
- 'span-metrics'
max_search_bytes_per_trace: 0
配置tempo.metricGenerator
, 指向部署的prometheus
metricsGenerator:
enabled: true
remoteWriteUrl: "http://prometheus-server.observe.svc.cluster.local:80/api/v1/write"
配置存储方式
persistence:
enabled: true
storageClassName: openebs-hostpath
执行部署
helm install tempo . -n observe
使用deploy/prometheus
修改persistentVolume.storageClass
执行部署
helm install prometheus . -n observe
确定部署mode
配置exportor
otlp:
endpoint: "tempo.observe.svc.cluster.local:4317"
tls:
insecure: true
prometheus:
endpoint: "0.0.0.0:8889"
#namespace: default
clickhouse:
endpoint: "tcp://clickhouse-headless.observe.svc.cluster.local:9000?dial_timeout=10s&compress=lz4"
database: otel
username: default
password: "cecf@cestong.com"
ttl_days: 10
#logs_table: otel_logs
#traces_table: otel_traces
#metrics_table: otel_metrics
timeout: 5s
retry_on_failure:
enabled: true
initial_interval: 5s
max_interval: 30s
max_elapsed_time: 300s
开启otel receiver
otlp:
protocols:
grpc:
endpoint: ${MY_POD_IP}:4317
http:
endpoint: ${MY_POD_IP}:4318
配置pipeline
,把otel
收到的trace
输出至clickhouse
和tempo ( otel )
配置pipeline
,把otel
收到的metrics
输出至clickhouse
和prometheus
配置podAnnotations
,让prometheus
自动来采集收集到的metrics
podAnnotations:
prometheus.io/scrape: "true"
prometheus.io/path: /metrics
prometheus.io/port: "8889"
执行部署
helm install otel-collector . -n observe
部署mysql
git地址:https://git.cestong.com.cn/cecf/cluster-config
所有配置已经修改好,直接部署即可
cd mysql
helm -n observe install mysql .
注意 当前配置使用的是NodePort向外暴露端口,端口固定为30306,客户环境可能需要更换,更换位置如下
# values.yaml文件 477行
476 nodePorts:
477 mysql: "30306"
初始化sql
# 获取mysql root密码
MYSQL_ROOT_PASSWORD=$(kubectl get secret --namespace observe mysql -o jsonpath="{.data.mysql-root-password}" | base64 -d)
# 进入到同一个命名空间的mysql客户端的pod中
kubectl run mysql-client --rm --tty -i --restart='Never' --image docker.io/bitnami/mysql:5.7.42-debian-11-r27 --namespace observe --env MYSQL_ROOT_PASSWORD=$MYSQL_ROOT_PASSWORD --command -- bash
# 登录mysql
mysql -h mysql.observe.svc.cluster.local -uroot -p"$MYSQL_ROOT_PASSWORD"
# 选择数据库
use otel;
# 执行sql语句
部署数据清洗程序
部署方式
kubectl -n observe apply -f cronjob.yaml
注意 要修改cronjob.yaml中 mysql 和 clickhouse 的用户及权限
修改ingress
,配置对应的域名
配置persistence.storageClassName
设置adminPassword
执行部署
helm install grafana . -n observe
安装clickhouse
插件
设置tempo
, prometheus
, clickhouse
数据源
导入dashboard
更改front/ui中的域名
编译打包
执行部署deploy/obui
kubectl apply -f deployment-front.yaml
kubectl apply -f deployment.yaml
kubectl apply -f ingress_rewrite.yaml
kubectl apply -f ingress.yaml
kubectl apply -f svc-front.yaml
kubectl apply -f svc.yaml
配置OTEL_COLLECTOR_NAME
指向部署的opentelemetry-collector
执行部署
helm install otel-demo . -n observe
在目标监控程序中增加Java参数
- '-javaagent:/sidecar/agent/opentelemetry-javaagent.jar'
- '-Dotel.resource.attributes=service.name=item-svc'
- '-Dotel.traces.exporter=otlp'
- '-Dotel.metrics.exporter=otlp'
deploy/obui
下的配置文件生成ingress-nginx
绑定的主机ip, 将域名绑定到这些ip服务 | CPU | Mem | 存储 |
---|---|---|---|
clickhouse | 4 | 8 | 200G |
tempo | 4 | 8 | 200G |
otel-collector | 3 | 6 | 0 |
prometheus | 2 | 4 | 100G |
grafana | 2 | 4 | 30G |
obui/front | 2 | 2 | 0 |
otel-demo | 4 | 8 | 0 |
总计 | 21核 | 40G | 530G |
服务 | CPU | Mem | 存储 |
---|---|---|---|
clickhouse | 4 | 8 | 200G |
clickhouse-zookeeper | 0.25/0.5 | ||
tempo | 4 | 8 | 200G |
otel-collector | 3 | 6 | 0 |
prometheus | 2 | 4 | 100G |
grafana | 2 | 4 | 30G |
obui/front | 2 | 2 | 0 |
otel-demo | 4 | 8 | 0 |
总计 | 21核 | 40G | 530G |
租户名称: observe
[x] docker hub的地址,push
[x] ingress class和storageclass名称
[ ] 域名转发到ingress上,ingress创建不了
[ ] prometheus mustFromJson