What are Attributes?

In machine learning, attributes are the data objects that are utilized.

  • Predictors, variables, and fields are all terms used to describe attributes.

Attributes are the predictors that influence a particular result in predictive models. Attributes are the pieces of data that are evaluated for natural groups or connections in descriptive models. A table of employee data, for example, can include features like job title, date of hiring, pay, age, gender, and so on.

Model and input attributes

The columns used to develop, test, or evaluate a model are known as data attributes.

Attributes can be the same in the data and the model. A column labeled SIZE, for example, with the values L, M, and S is an attribute utilized by an algorithm to generate a model. The attribute SIZE is almost certainly the same as the data attribute.

A column SALES PROD, on the other hand, does not relate to a model property because it contains sales numbers for a set of goods. The data property can be SALES PROD, but the model attribute is each product and its related sales figure.

A disparity between model and data attributes is also caused by transformations. A transformation, for example, can perform a computation on data attributes and save the result in a completely new attribute. This newly formed attribute is a model that doesn’t have a data counterpart. Normalization, outlier treatment, and binning are examples of modifications that cause the model to deviate from the data attribute.


Understand the different forms of target data and what a target implies in machine learning.

A supervised model’s goal is a certain type of attribute. The historical values used to train the model are stored in the target column of the training data. The historical values against which the predictions are compared are stored in the target column of the test data. The act of scoring generates a target forecast.

  • A target is not used in clustering, feature extraction, association, or anomaly detection models. Targets cannot be nested columns or columns with unstructured data.

Attributes can be numerical, category, or unstructured.

Theoretically, numerical qualities can have an endless number of values. The values are sorted implicitly, and the discrepancies between them are ordered as well. Categorical attributes are characteristics with values that identify a finite number of discrete categories or classes. The values don’t have any sort of implied order. Binary categoricals have just two potential values, such as yes or no, or male or female. Multi-class categoricals have more than two values, such as small, medium, and large.

Region Attribute

Start with a shortcut option for the simplest configuration and tweak as appropriate. You may also utilize the cache to save your own custom settings for usage by several regions.

Through area shortcuts and region properties, you may automate the administration of data regions and their entries. The location of the data, how the area is handled in memory, dependability behavior, and the automated loading, distribution, and expiration of data entries are all determined by these region configuration options.

  • When feasible, utilize region shortcuts to configure your region, and use region attributes to further modify behavior. The most frequent region configurations are pre-configured in the shortcut settings.