{ "id": "2007.04785", "version": "v1", "published": "2020-07-09T13:28:49.000Z", "updated": "2020-07-09T13:28:49.000Z", "title": "Neural Architecture Search with GBDT", "authors": [ "Renqian Luo", "Xu Tan", "Rui Wang", "Tao Qin", "Enhong Chen", "Tie-Yan Liu" ], "comment": "Code is available at https://github.com/renqianluo/GBDT-NAS", "categories": [ "cs.LG", "stat.ML" ], "abstract": "Neural architecture search (NAS) with an accuracy predictor that predicts the accuracy of candidate architectures has drawn increasing interests due to its simplicity and effectiveness. Previous works employ neural network based predictors which unfortunately cannot well exploit the tabular data representations of network architectures. As decision tree-based models can better handle tabular data, in this paper, we propose to leverage gradient boosting decision tree (GBDT) as the predictor for NAS and demonstrate that it can improve the prediction accuracy and help to find better architectures than neural network based predictors. Moreover, considering that a better and compact search space can ease the search process, we propose to prune the search space gradually according to important features derived from GBDT using an interpreting tool named SHAP. In this way, NAS can be performed by first pruning the search space (using GBDT as a pruner) and then searching a neural architecture (using GBDT as a predictor), which is more efficient and effective. Experiments on NASBench-101 and ImageNet demonstrate the effectiveness of GBDT for NAS: (1) NAS with GBDT predictor finds top-10 architecture (among all the architectures in the search space) with $0.18\\%$ test regret on NASBench-101, and achieves $24.2\\%$ top-1 error rate on ImageNet; and (2) GBDT based search space pruning and neural architecture search further achieves $23.5\\%$ top-1 error rate on ImageNet.", "revisions": [ { "version": "v1", "updated": "2020-07-09T13:28:49.000Z" } ], "analyses": { "keywords": [ "neural architecture search", "leverage gradient boosting decision tree", "works employ neural network", "error rate", "better handle tabular data" ], "tags": [ "github project" ], "note": { "typesetting": "TeX", "pages": 0, "language": "en", "license": "arXiv", "status": "editable" } } }