EVALUATION OF THE EFFICIENCY OF THE RECURSIVE DATA SET DISTRIBUTION PROCESS USING THE CART ALGORITHM
DOI:
https://doi.org/10.31891/2307-5732-2023-323-4-25-35Keywords:
algorithm, Classification and Regression Tree, Gini index, Receiver Operating Characteristic, Area Under the CurveAbstract
The paper presents the results of research and a comparison of the results of foreign and domestic works, which showed the high efficiency of the CART model in predicting the effectiveness of advertising campaigns, which coincides with the conclusions of other researchers. The given comparison allows to confirm the advantages and stability of the algorithm in the context of evaluating advertising campaigns. The algorithm of data collection, processing and analysis for the application of the CART method is given. The process of dividing nodes, which is carried out before reaching a given number of nodes or until reaching a certain level of tree depth, is considered. The evaluation function used for node division and based on the Gini-index, which estimates the impurity in the node, is given. The lower the impurity of the node, the more it is considered important for further division. A model for evaluating the effectiveness of advertising campaigns using the CART algorithm has been developed. The method of checking the accuracy of the developed model is given. The results of the model are compared with real data. The use of GridSearchCV to perform searches in the depth range from 1 to 10 is analyzed. The F1 score is given as an evaluation metric. The cv parameter in question specifies the number of convolutions to use in the cross-validation process. The novelty of the study is the use of the CART algorithm to evaluate the effectiveness of advertising campaigns. A method is analyzed that allows you to quickly and accurately analyze large volumes of data and determine the most important factors that affect the effectiveness of advertising campaigns. The practical value of the research is substantiated, which is that the developed algorithm allows rational use of the budget for marketing activities and optimization of advertising campaigns in order to achieve the best results.