(PDF) Machine Learning Toolkit - Splunk · apply Apply a machine learning model that was learned using the fit command. ... Splunk, Splunk>, Listen to Your Data, The Engine for Machine - DOKUMEN.TIPS (2024)

MACHINE LEARNING (ML) | QUICK REFERENCE GUIDE

Algorithm Examples

OneClassSVM... | fit OneClassSVM * kernel=“poly” nu=0.5 coef0=0.5 gamma=0.5 tol=1 degree=3 shrinking=f into TESTMODEL _ OneClassSVM

Preprocessing Preprocessing algorithms are used for preparing data and help with prediction accuracy.

Command Description Syntax

fit Fit and apply a machine learning model to search results.

... | fit algorithm y from x params into model _ name as output _ field

apply Apply a machine learning model that was learned using the fit command.

... | apply model _ name as output _ field

summary Return a summary of a machine learning model that was learned using the fit command.

| summary model _ name

listmodels Return a list of machine learning models that were learned using the fit command.

| listmodels

deletemodel Delete a machine learning model that was learned using the fit command.

| deletemodel model _ name

sample Randomly sample or partition events. ... | sample options by split _ by _ field

Algorithm Examples

FieldSelector... | fit FieldSelector type=categorical SLA _ violation from *

PCA ... | fit PCA * k=3

KernelPCA ... | fit KernelPCA * k=3 gamma=0.001

TFIDF... | fit TFIDF Reviews into user _feedback _ model max _ def=0.6 min _ def=0.2

Algorithm Examples

StandardScaler ... | fit StandardScaler *

Machine Learning ToolkitUse this document for a quick list of ML search commands as well as some tips on the more widely used algorithms from the Machine Learning Toolkit.

Search Commands for Machine Learning

The Machine Learning Toolkit provides custom search commands for applying machine learning to your data.

Feature Extraction

Feature extraction algorithms transform fields for better prediction accuracy.

Algorithm Examples

KMeans ... | fit KMeans * k=3

DBSCAN ... | fit DBSCAN *

BIRCH ... | fit Birch * k=3

SpectralClustering ... | fit SpectralClustering * k=3

Cluster Numeric

Partition events with multiple numeric fields into clusters.

Forecasting Forecast future values given past values of a metric (numeric time series).

Algorithm Examples

ARIMA ... | fit ARIMA Voltage order=4-0-1

Anomaly Detection Find events that contain unusual combinations of values.

FREQUENTLY USED ALGORITHMS

Download Machine Learning Toolkit. Read the Machine Learning Documentation.

© 2017 Splunk Inc. All rights reserved. Splunk, Splunk>, Listen to Your Data, The Engine for Machine Data, Splunk Cloud, Splunk Light and SPL are trademarks and registered trademarks of Splunk Inc. in the United States and other countries. All other brand names, product names, or trademarks belong to their respective owners. GDE-Splunk-MachineLearning-QuickRefGuide-102

MACHINE LEARNING (ML) | QUICK REFERENCE GUIDE

docs.splunk.com

Predict Numeric Predict the value of a numeric field using the values of other fields in that event.

Algorithm Examples

LinearRegression ... | fit LinearRegression temperature from date _ month date _ hour into temperature _ model

Lasso ... | fit Lasso temperature from date _ month date _ hour

Ridge ... | fit Ridge temperature from date _ month date _ hour normalize=true alpha=0.5

ElasticNet ... | fit ElasticNet temperature from date _ month date _ hour normalize=true alpha=0.5

KernelRidge ... | fit KernelRidge temperature from date _ month date _ hour into temperature _ model

SGDRegressor ... | fit SGDRegressor temperature from date _ month date _ hour into temperature _ model

DecisionTreeRegressor ... | fit DecisionTreeRegressor temperature from date _ month date _ hour into temperature _ model

RandomForestRegressor ... | fit RandomForestRegressor temperature from date _ month date _ hour into temperature _ model

Predict Categorical Predict the value of a categorical field using the values of other fields in that event.

Algorithm Examples

LogisticRegression ... | fit LogisticRegression SLA _ violation from IO _ wait _ time into sla _ model

SVM ... | fit SVM SLA _ violation from * into sla _ model

BernoulliNB ... | fit BernoulliNB type from * into TESTMODEL _ BernoulliNB alpha=0.5 binarize=0 fit _ prior=f

GaussianNB ... | fit GaussianNB species from * into TESTMODEL _ GaussianNB

SGDClassifier ... | fit SGDClassifier SLA _ violation from * into sla _ model

DecisionTreeClassifier ... | fit DecisionTreeClassifier SLA _ violation from * into sla _ model

RandomForestClassifier ... | fit RandomForestClassifier SLA _ violation from * into sla _ model

Predict Numeric Fields (Regression)

LinearRegressionLassoRidgeElasticNetKernelRidgeSGDRegressorDecisionTreeRegressorRandomForestRegressor

Forecast Numeric Time Series

ARIMAKalmanFilter (use predict command)ACF (autocorrelation function)PACF (partial autocorrelation function)

Cluster Numeric Events

KMeansDBSCANBIRCHSpectralClustering

Feature Extraction

FieldSelectorPCAKernelPCATFIDF

Preprocessing

StandardScaler

Detect Categorical Outliers

OneClassSVManomalydetection (command)

Detect Numeric Outliers

OneClassSVMstreamstats, median, mean, p25, p75

Predict Categorical Fields (Classification)

LogisticRegressionSVMBernoulliNBGaussianNBSGDClassifierDecisionTreeClassifierRandomForestClassifier

Start

Predict a Field Value

Prepare DataGroup Events

Forecast Future Values

Numeric Categorical

Detect Outliers

(PDF) Machine Learning Toolkit - Splunk · apply Apply a machine learning model that was learned using the fit command. ... Splunk, Splunk>, Listen to Your Data, The Engine for Machine - DOKUMEN.TIPS (2024)

FAQs

Is Splunk good for machine learning? ›

Plus, the Splunk App for Anomaly Detection lets users detect anomalies using powerful machine learning algorithms in just a few clicks.

Is Splunk an AI tool? ›

Splunk ITSI is an Industry Leader in AIOps

Using AI and machine learning, ITSI correlates data collected from monitoring sources and delivers a single live view of relevant IT and business services, reducing alert noise and proactively preventing outages.

How can Splunk help with automating the detection of anomalies? ›

The Splunk App for Anomaly Detection finds anomalies in time-series datasets and provides an end-to-end workflow to manage and operationalize anomaly detection tasks. The app detects seasonal patterns and determines all of the optimal parameters automatically.

What is Splunk machine data? ›

Machine data contains a definitive record of all activity and behavior of your customers, users, transactions, applications, servers, networks, factory machinery, and so on. And it's more than just logs.

What are machine learning tools used for? ›

A Decision Process: In general, machine learning algorithms are used to make a prediction or classification. Based on some input data, which can be labeled or unlabeled, your algorithm will produce an estimate about a pattern in the data.

Is IT hard to learn Splunk? ›

If you're starting from scratch, the initial learning curve for Splunk can be relatively steep. However, dedicating 1-2 weeks to grasp the fundamentals, including data ingestion, searching, and basic visualization, will provide a solid foundation.

Is Splunk useful for data science? ›

Splunk is a leading platform for data analysis and management. It is widely used by businesses to monitor, analyze, and visualize data from different sources. The platform is known for its ease of use, flexible architecture, and powerful search capabilities.

What makes Splunk so good? ›

A Splunk log is highly scalable and easy for organizations to implement. It is able to find useful information within organizations' data without users having to identify it themselves. It saves searches and tags that it recognizes as important information, which helps organizations make their systems smarter.

What language is Splunk using? ›

Splunk developed the Search Processing Language (SPL) to use with Splunk software. SPL encompasses all the search commands and their functions, arguments, and clauses.

Is Splunk a tool or software? ›

Splunk is a software platform that helps organizations search, monitor and analyze data from any source.

How does Splunk work? ›

Splunk helps organizations extract value from server data. This enables efficient application management, IT operations management, compliance and security monitoring. At the center of Splunk is an engine that collects, indexes and manages big data. It can handle terabytes of data or more in any format every day.

How does Splunk use machine learning? ›

Through intensive training, AI and machine learning establish baselines for your data and detect deviations from past behavior or atypicalities that might otherwise go undetected.

Can Splunk be automated? ›

Splunk's data-centric approach, backed by the power of machine learning, further amplifies its capabilities. Splunk SOAR empowers users to easily automate security tasks with playbooks that can be customized to fit your needs.

What are the benefits of Splunk monitoring? ›

Splunk is widely used for monitoring and troubleshooting system issues. Splunk is employed in generating dashboards and reports. Splunk allows the expert in tracking user activity and behavior. Splunk enables the professional in monitoring compliance with suggested company policies.

What is machine learning toolbox? ›

For analyzing multidimensional data, the Statistics and Machine Learning Toolbox lets users identify key variables or features that impact their model with sequential feature selection, stepwise regression, principal component analysis, regularization, and other dimensionality reduction methods.

What is the purpose of the training toolkit? ›

A training toolkit is a collection of resources, methods, and tools that help you plan, design, deliver, and evaluate training activities. In this article, you'll learn how to build a training toolkit that works for you, whether you're a trainer, a learner, or both.

What is the purpose of a toolkit? ›

Usually, a toolkit targets one issue and one set of audiences at a time. It helps that particular set of audience learn about the topic and identify approaches for addressing it.

References

Top Articles
Latest Posts
Article information

Author: Francesca Jacobs Ret

Last Updated:

Views: 6361

Rating: 4.8 / 5 (48 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Francesca Jacobs Ret

Birthday: 1996-12-09

Address: Apt. 141 1406 Mitch Summit, New Teganshire, UT 82655-0699

Phone: +2296092334654

Job: Technology Architect

Hobby: Snowboarding, Scouting, Foreign language learning, Dowsing, Baton twirling, Sculpting, Cabaret

Introduction: My name is Francesca Jacobs Ret, I am a innocent, super, beautiful, charming, lucky, gentle, clever person who loves writing and wants to share my knowledge and understanding with you.