Introduction:The intelligent operation and maintenance platform helps enterprises run smoothly, intelligently and efficiently.
Behind every serious accident, there must be 29 minor accidents, 300 near miss precursors and 1000 potential accidents. ——Heinrich’s law
With the advent of the era of cloud computing, a large number of enterprises will gradually migrate their business to the cloud. The flexibility of cloud computing makes it easier to purchase and expand it resources. Many enterprises do not have to spend huge labor time, invest in the purchase, expansion, upgrade and so on of IDC, server, network card and other physical resources.
However, with the development of information age, business online, business system service customer scale is larger, system stability becomes more important. Once a fault occurs, the operation and maintenance personnel can not give early warning and find out the problem in the fastest time, which is very easy to affect the business and cause huge losses.
In the face of huge business system, various service modules, massive log and monitoring information, and extreme business requirements, how to build an enterprise’s fast, automatic, intelligent, full cycle intelligent operation and maintenance and early warning system has become a part of enterprise competitiveness.
Therefore, more and more enterprises begin to build their own intelligent operation and maintenance platform, trying to reduce the burden of operation and maintenance personnel, as well as more intelligent fault warning and faster response. Especially in this business cloud era, the value intensive enterprise operation and maintenance platform has been upgrading. Many enterprises have upgraded from the basic resource purchase and maintenance to the promotion of business value.
Due to the utilization and analysis of Pan log data, it is becoming a part of enterprise competitiveness. The research report shows that with the rapid growth of Pan log machine data and scale, enterprises need to provide reference for business through more intelligent operation and maintenance platform. In addition, the time value density of the business system is gradually increasing, the customer service of the system is growing exponentially, and the complexity of the business system and the scale of the cluster become larger. Therefore, a stable, efficient and affordable intelligent operation and maintenance platform has gradually become the foundation for an enterprise to settle down.
However, when building an intelligent operation and maintenance platform, we often encounter the following challenges:
1. Massive logs, how to quickly analyze alarms: for example, hundreds of billions, trillions of logs, how to query and analyze in real time? How to ensure stable write in real time?
2. The system is complex, and there are dozens of dimensions of data. How to conduct multi-dimensional analysis more efficiently?
3. How to find the most important information? For example, there are tens of thousands of error logs. How to find important information?
4. How to meet different analysis modes and storage requirements from real-time data to historical data?
In recent years, Alibaba cloud has encountered many such problems when serving Alibaba economy and Alibaba cloud customers. Through continuous polishing, alicloud has launched alicloud log service SLS to help customers build intelligent operation and maintenance platform.
At the cloud habitat Conference on September 18, Huajian, a senior intelligent product expert of Alibaba cloud, presented a sharing entitled “log service of intelligent operation and maintenance platform in the cloud era, helping enterprises innovate and iterate”. In the sharing, he introduced in detail how the log service SLS provides users with one-stop log collection, alarm, storage, analysis and visualization capabilities to help enterprises In the cloud era, the technical operation and maintenance personnel of the industry can build their own intelligent operation and maintenance platform based on SLS, quickly analyze the system status, insight into the business, and help the enterprise’s rapid iteration and business innovation.
Alicloud log service SLS has the following advantages:
1. Second level real-time analysis and alarm: out of 100 billion logs, the second level returns 1 billion records
2. Multi dimensional data joint analysis capabilities: and provide high-dimensional query, real-time analysis, timed tasks, visualization capabilities
3. Further insight into details: from second level details, AI detection of abnormal points, and data clustering, it can help to find important information
These capabilities help customers conduct unified collection, storage and analysis of logging, metric and tracing data, meet the requirements of business monitoring, log analysis and security audit, and easily solve the challenges of fast, multidimensional and in-depth detailed analysis.
At the same time, SLS provides full cycle data transfer capability. Recently released data processing and data delivery can support different analysis modes and cycle requirements. With data processing, enterprises can regulate and ETL the data according to different analysis requirements. By using data delivery, enterprises can meet the requirements of data flow in different time periods.
Therefore, in this business cloud era, we need a more intelligent operation and maintenance platform to help our business run smoothly, intelligently and efficiently. We firmly believe that the enterprise’s intelligent operation and maintenance platform based on alicloud log service SLS can really help enterprises carry out innovative iteration of business value, and help enterprise customers’ business develop more stably and rapidly.
Link to original text
This article is the original content of Alibaba cloud and can not be reproduced without permission.