8000 GitHub - json007/ATFE: Automated Feature Engineering
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

json007/ATFE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Automated Feature Engineering



💡 Introduction

最新更新:
原本是采用featuretools+gplearn+autoint,后测试gplearn的SymbolicTransformer这种metric用的是Pearson correlation coefficient,它效果不如专门的Regressor或Classifier效果好,然后直接换成了实测效果更好的EvolutionaryForest来实现。

目前的实现主要逻辑是featuretools + EvolutionaryForest + DCN V2.
原代码不维护了。


This project aims to automate feature engineering to extract efficient features directly from raw data without human involvement.

  1. First, new features are extracted from raw data based on different field meanings, such as extracting whether it is a holiday or not based on the date.
  2. Secondly, do meaning correlation of different fields, such as date and consumption, to extract the consumption during the vacation period.
  3. Further, perform logical combination operations on the features generated earlier, such as adding and dividing two columns of features.
  4. Finally, extract the features with algorithms, such as autoint and clustering.

This tool uses the packages featuretools, gplearn, autoint, scikit-learn, xgboost.


This project does not include feature preprocessing; preprocessing should be placed in the model training phase, as different models require different preprocessing methods.



⚡ Usage

# waitting for readthedocs.io

About

Automated Feature Engineering

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0