Install insight-agent in Suanova 4.0¶
In AI platform, previous Suanova 4.0 can be accessed as a subcluster. This guide provides potential issues and solutions when installing insight-agent in a Suanova 4.0 cluster.
Issue One¶
Since most Suanova 4.0 clusters have installed dx-insight as the monitoring system, installing insight-agent at this time will conflict with the existing prometheus operator in the cluster, making it impossible to install smoothly.
Solution¶
Enable the parameters of the prometheus operator, retain the prometheus operator in dx-insight, and make it compatible with the prometheus operator in insight-agent in 5.0.
Steps¶
- Log in to the console.
- Enable the --deny-namespaces parameter in the two prometheus operators respectively.
-
Run the following command (the following command is for reference only, the actual command needs to replace the prometheus operator name and namespace in the command).
Note
- As shown in the figure above, the dx-insight component is deployed under the dx-insight tenant, and the insight-agent is deployed under the insight-system tenant. Add --deny-namespaces=insight-system in the prometheus operator in dx-insight, Add --deny-namespaces=dx-insight in the prometheus operator in insight-agent.
- Just add deny namespace, both prometheus operators can continue to scan other namespaces, and the related collection resources under kube-system or customer business namespaces are not affected.
- Please pay attention to the problem of node exporter port conflict.
Supplementary Explanation¶
The open-source node-exporter turns on hostnetwork by default and the default port is 9100. If the monitoring system of the cluster has installed node-exporter , then installing insight-agent at this time will cause node-exporter port conflict and it cannot run normally.
Note
Insight's node exporter will enable some features to collect special indicators, so it is recommended to install.
Currently, it does not support modifying the port in the installation command. After helm install insight-agent , you need to manually modify the related ports of the insight node-exporter daemonset and svc.
Issue Two¶
After Insight Agent is successfully deployed, fluentbit does not collect logs of Suanova 4.0.
Solution¶
The docker storage directory of Suanova 4.0 is /var/lib/containers , which is different from the path in the configuration of insigh-agent, so the logs are not collected.
Steps¶
- Log in to the console.
-
Modify the following parameters in the insight-agent Chart.
fluent-bit: daemonSetVolumeMounts: - name: varlog mountPath: /var/log - name: varlibdockercontainers - mountPath: /var/lib/docker/containers + mountPath: /var/lib/containers/docker/containers readOnly: true - name: etcmachineid mountPath: /etc/machine-id readOnly: true - name: dmesg mountPath: /var/log/dmesg readOnly: true