Logstash stands out in log management, due to its speed, ease of use, and integration with complementary search and reporting tools
Elasticsearch, Inc., the commercial firm behind the open source Elasticsearch search engine, released version 1.4 of Logstash last week. Logstash is one of the most popular log management tools available today, though it competes in a crowded space with projects like Scribe, Flume, Chukwa, Fluentd, and Kafka.
The 1.4 release of Logstash contains a number of important improvements, the most obvious being the quicker startup time, now approximately three times faster. The new release maintains the radical emphasis on ease of use, which is a hallmark of the entire ELK (Elasticsearch, Logstash, and Kibana β the last for reporting and visualization) stack.
[ Work smarter, not harder β download the Developersβ Survival Guide from InfoWorld for all the tips and trends programmers need to know. | Keep up with the latest developer news with InfoWorldβs Developer World newsletter. ]
Along with a quicker startup, Logstash 1.4 features an improved installation process. Version 1.4 also includes a simplified plug-in system that makes it even easier for users to customize their Logstash install to specific business needs, as well as redesigned Puppet modules to make it simpler to automate installation and configuration. Youβll also find expanded documentation, with a new and improved get-started guide.
The Logstash legacy
Logstash was born out of Jordan Sisselβs background in devops and system administration, when he found himself constantly dealing with large numbers of log files and needed a centralized mechanism to aggregate and manage them. Logstash was originally conceived without any awareness that Elasticsearch even existed, but as Sissel puts it, βwriting storage systems is boring.β When he discovered Elasticsearch in 2009, it was a perfect fit to store all that log data. Sissel joined Elasticsearch in August 2013.
Over time, Logstash has grown along with the other components of the ELK stack to become part of a comprehensive platform for using log data and helping businesses gain insight into how customers are interacting with e-commerce sites, support systems, and more.
βLogstash can get data from unknown places and from any source and will clean it up, so you donβt have to worry about the exact log types or reconciling different data formats,β says Sissel. βWe handle it all and let you slice and dice that data with Elasticsearch. Serve it up nice and pretty with a side of Kibana, and you have instant feedback on how to better please your customers and drive business success.β
Democratizing business data
Sissel and the Elasticsearch team refer to this as βdemocratizing business data.β ELK is especially good at dealing with βany data with an element of time associated with it,β but itβs not limited to log data. Almost any type of data is ultimately a candidate to be stored, analyzed, and visualized using ELK.
Of course, the idea of βdemocratizing access to dataβ raises issues related to security and access control. Elasticsearch currently does not have a native access-control facility, although itβs on the road map. As Sissel explains, βWe donβt have it yet because security is something you canβt do halfway, so we want to make sure itβs very good before launching it.β
In the meantime, Elasticsearch recommends implementing access control at the HTTP level using HTTP proxies and firewalls. Hereβs a documented example of this configuration.
The rest of the world
While ELK is a powerful stack, itβs not meant to be the be-all and end-all. As such, the creators have taken care to provide interoperability with the rest of the world. Logstash currently bundles output connectors for 60 or more different systems. The range of possible outputs includes such diverse possibilities as AWS S3 buckets, IRC, Solr, MongoDB, Redis, Riak, XMPP, and many more.
Sissel points out that Logstash can be used as part of a more complex analytics workflow, such as complex event processing with Esper, Storm, or S4 β or even batch processing with Hadoop. While Logstash does not include an HDFS output connector today, Sissel says it may arrive in the future, βif we see community demand for it.β
Another case where Logstash is more appropriately used as a complement to other tools is the βdocument ingestionβ scenario. Logstash really is an event/log based system, and itβd be an awkward fit for trying to crawl and consume a document repository and load those documents into Elasticsearch. In such a scenario, a cleaner solution involves using ManifoldCF or Nutch to handle βdocumentβ data, with Logstash as a peer to handle event/log-oriented data.
Open source and support
Logstash is fully open source and licensed under the business-friendly Apache License Version 2.0 (ALv2). Source code is available at GitHub. Downloads of both Logstash and the rest of the ELK Stack components are available at elasticsearch.org.
Organizations can receive support from the engineers that built Logstash and the ELK stack by subscribing to annual support offerings from Elasticsearch, Inc. ELK Stack subscriptions also include free licenses to Marvel, a real-time monitoring system for ELK deployments.
Elasticsearch, Inc. has seen revenue growth of over 400 percent year over year and reached nearly 6 million downloads. Given the companyβs track record over the past few years β and the history of the founders as contributors to projects like Logstash and Apache Lucene β itβs fair to expect a steady stream of innovative new products from Elasticsearch in the future.
This article, βWhatβs new in Logstash and why you should care,β was originally published at InfoWorld.com. Keep up on the latest news in application development and read more of Andrew Oliverβs Strategic Developer blog at InfoWorld.com. For the latest business technology news, follow InfoWorld.com on Twitter.


