NTT WEST Group is a company committed to solving social and regional issues through telecommunications, ICT services, and solutions.
AsiaQuest, together with NTT Business Solutions, Inc. (a group company of NTT WEST; hereinafter referred to as “NTT Business Solutions”), consistently provides operational improvement, optimization, and SRE technical support for the AWS environment that underpins NTT WEST’s business chat system “elgana”. Through this support, infrastructure operation, monitoring, and security enhancements have been strengthened, enabling stable system operation and continuous improvement.
※ SRE (Site Reliability Engineering)
A methodology for ensuring the stable operation of IT systems.
By leveraging monitoring and automation to prevent failures and enable rapid recovery, SRE enhances service reliability while improving operational efficiency.
“elgana”, provided by NTT West, is a business-focused chat service.
It is a highly reliable communication platform with numerous implementation achievements, used by customers across a wide range of industries and business types, as well as in the public sector as a mission-critical communication tool in emergencies.
In 2022, the service was rehosted from another cloud provider to AWS. Since then, modernization efforts have continued, further evolving the system as a foundation for business communication.
After the migration, not only was stable operation required, but rapid responses to challenges related to non-functional requirements also became necessary. To achieve this, the support of a partner with advanced AWS expertise was indispensable.
Thus, NTT West engaged AsiaQuest, with a proven track record in AWS co-creation support, to improve and optimize the operation of the “elgana” system.
AsiaQuest supports the customer’s SRE team by working alongside them as a member of the team. Together, we propose and implement improvements to daily operational issues, striving for stable system operation and continuous enhancement.
Investigated the current AWS environment and proposed improvement plans leveraging AWS services.
Conducted regular meetings several times per week during the initial phase, and weekly after migration, to address issues arising from configuration changes.
Improved SIEM on OpenSearch by refining design, selecting relevant data, securing necessary specifications, and configuring access rights for production use.
Investigated causes of cost increases due to changes in usage and implemented several cost optimization measures, achieving significant cost savings.
Examples of Operational Support and Improvement Proposals
The system configuration of the elgana environment, along with the proposed improvements, is outlined below.
As the log analysis platform, SIEM on OpenSearch was adopted because it could be built quickly and cost-effectively while covering the required features.
In the elgana environment, several hundred gigabytes of logs are generated daily. Importing all logs into OpenSearch would not have been cost-effective or realistic. Therefore, through a proof of concept (PoC), we supported the customer by designing index rotation intervals and shard numbers, reviewing instance specifications, selecting which logs should be ingested for analysis, and creating dashboards.
For security alert notifications, based on the customer’s security requirements, it was necessary to detect unauthorized or abnormal activities. To address this, we proposed a mechanism that notifies login events and threat events, enabling the detection of potential security risks.
Furthermore, regarding security incident investigations, when GuardDuty detects a threat event, the customer expresses the need to analyze VPC Flow Logs (particularly for abnormal outbound communications). While the use of OpenSearch was considered, the large log volume and high visualization costs posed challenges. Therefore, we first proposed Amazon Detective, which is easy to configure and provides effective visualization capabilities.
To support cost optimization, we configured notifications in AWS Budgets and Cost Anomaly Detection, enabling early detection of anomalies.
So far, we have identified cost increases in services such as NAT Gateway, S3, CloudTrail, Data Firehose, AWS Config, and CloudFront, investigated their causes, and proposed corresponding countermeasures.
Since 2025, we have taken over SRE operations previously handled by another vendor, focusing on ensuring stable system operation and driving continuous improvement.
The main responsibilities of the SRE team include:
In the elgana project, in order to continuously deliver a truly “cutting-edge and innovative elgana” to customers, we plan to accelerate modernization while steadily advancing architectural improvements.
As a co-creation partner supporting the ever-changing and evolving elgana, AsiaQuest will continue engaging in close dialogue with the customer, working together to solve challenges and further enhance service quality.
In the elgana project, our top priority was rapid development and release. As a result, we faced the following challenges in building and operating the service infrastructure:
1.Because we focused heavily on implementation, our monitoring and cost-optimization mechanisms needed to be strengthened.
2.As we accelerated the release cycle, an increasing amount of work became dependent on specific individuals.
Given this situation, AsiaQuest provided practical proposals tailored to our specific needs, and we first began to see tangible results in cost optimization and the enhancement of our monitoring framework.
Furthermore, by consolidating both the build and operational processes under AsiaQuest, we achieved greater standardization through codification and improved visibility. This eliminated dependency on individual expertise and enabled more stable and reliable infrastructure compared to the past.
We strongly feel that AsiaQuest is not merely a service provider, but a true partner that works alongside us to build and evolve the elgana service platform.