[카테고리:] Monitoring : Zabbix & ETC
-
Lenovo XCC2 SNMP-Based Power Consumption Monitoring (Zabbix Integration Guide)
In server operations, CPU, memory, and disk health are common monitoring targets—but power consumption is equally critical.A datacenter runs on electricity. Without clear visibility into power usage, it becomes impossible to calculate electricity bills, plan rack capacity, prevent overload, or mitigate failures. Lenovo servers ship with XClarity Controller2 (XCC2), a BMC (Baseboard Management Controller) that…
-
Lenovo XCC2 SNMPv3 기반 전력 사용량 모니터링 (Zabbix 연동 가이드)
서버 운영에서 CPU, 메모리, 디스크 상태 못지않게 중요한 지표가 있다. 바로 전력 사용량(Power Consumption) 이다.데이터센터는 전력으로 움직인다. 전력 소비량을 알지 못하면 전기요금 산정, 랙별 전력 용량 계획, 과부하 방지, 장애 예방이 모두 불가능하다. Lenovo 서버의 XClarity Controller2 (XCC2) 는 BMC(Baseboard Management Controller)로, 원격 관리와 함께 SNMP(Simple Network Management Protocol)를 통해 전력, 온도, 팬 속도 같은…
-
Enterprise Zabbix Extension Pattern: Collecting K8S Pod Network Traffic (via cAdvisor)
By default, Zabbix Kubernetes templates provide only basic metrics such as CPU, memory, disk, node status, and container status. Pod-level network traffic is missing.In many environments, this gap is covered by Prometheus. Here, however, is a case study where Zabbix alone was extended to collect Pod-level RX/TX traffic metrics. This design was built from scratch…
-
엔터프라이즈 Zabbix 확장 패턴: K8S Pod 네트워크 트래픽 수집 (cAdvisor 기반)
일반적으로 Zabbix Kubernetes 템플릿은 CPU, 메모리, 디스크, 노드 상태, 컨테이너 상태 위주의 기본 지표만 제공한다. Pod 단위 네트워크 트래픽은 빠져 있다. 많은 운영 환경에서는 Prometheus를 통해 이 문제를 해결하지만, 여기서는 Zabbix만으로 Pod-level 네트워크 트래픽 수집을 완성한 사례를 공유한다. 이 설계는 레퍼런스가 전혀 없는 상태에서 직접 만든 것으로, 검증된 운영 환경에서 이미 활용 중이다. 설계 개요…
-
An Examination of Monitoring Metrics: Part 5 MongoDB
MongoDB is more than a simple document database. It is widely used as a session store, log analytics engine, and even a messaging backbone.To ensure stable operations, monitoring must cover availability, performance, resources, cursors & connections, and network usage.The following are the core metrics that should always be part of a MongoDB monitoring strategy. 1.…
-
An Examination of Monitoring Metrics: Part 4 Elasticsearch
1. Cluster Health Metrics cluster healthOverall cluster status. unassigned shardsNumber of shards not assigned to any node. 2. Resource Metrics Total size of all file stores / Total available size to JVM in all file stores ⚠️ Problem points when Available decreases rapidly Summary jvm_heap_usage_percent node uptime 3. Performance Metrics query latencySearch query response time.…
-
An Examination of Monitoring Metrics: Part 3 Redis
1. Memory Metrics used_memory mem_fragmentation_ratio evicted_keys 2. Performance Metrics instantaneous_ops_per_sec slowlog 3. Connection Metrics blocked_clients connected_clients rejected_connections 4. Network Metrics total_net_input_bytes / total_net_output_bytes 5. Persistence Metrics (Persistence: the property of data being safely preserved beyond memory to disk) rdb_last_bgsave_status aof_last_bgrewrite_status rdb_changes_since_last_save 6. Cache Efficiency Metrics keyspace_hits / keyspace_misses ⚠ Note: The default Redis template for…
-
An Examination of Monitoring Metrics: Part 2 Kafka
In the previous article, we looked at MySQL metrics. This time, we turn to Kafka.In production environments, Kafka has grown beyond being just a simple message queue to become a critical data streaming platform.Therefore, closely monitoring the state of Kafka brokers and clusters is essential for preventing incidents and ensuring stable performance. In this article,…
-
An Examination of Monitoring Metrics: Part 1 MySQL
In production environments, MySQL is more than just a relational database — it serves as the core data store and the foundation of service stability for countless applications.Therefore, closely monitoring MySQL servers and query performance is essential for preventing failures and ensuring optimal performance. Reading MySQL Performance with Zabbix (Summary) 1) InnoDB: Open Files Meaning:…
-
모니터링 지표에 대한 고찰 : 다섯번째 대상 MongoDB
MongoDB는 단순 문서형 데이터베이스를 넘어 세션 저장소, 로그 분석, 메시징까지 다양한 영역에서 활용된다.안정적 운영을 위해서는 가용성, 성능, 리소스, 커서·연결, 네트워크 영역별 주요 지표를 균형 있게 점검해야 한다.아래는 일반적으로 반드시 살펴야 할 핵심 지표들이다. 1. 가용성 지표 uptime 2. 성능 지표 operation latency (total) operation throughput (rate) 3. 리소스 지표 connections memory usage (resident memory) wiredTiger…