Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1651–1700 of 1706 papers

Title	Date	Tasks	Status
Improved Sentence-Level Arabic Dialect Classification	Aug 1, 2014	ClassificationFeature Engineering	—Unverified
An Error Analysis Tool for Natural Language Processing and Applied Machine Learning	Aug 1, 2014	BIG-bench Machine LearningFeature Engineering	—Unverified
Feature Engineering for Knowledge Base Construction	Jul 24, 2014	Feature EngineeringKnowledge Base Construction	—Unverified
Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network	Jun 15, 2014	Feature EngineeringInformation Retrieval	—Unverified
Robust Domain Adaptation for Relation Extraction via Clustering Consistency	Jun 1, 2014	ClusteringDomain Adaptation	—Unverified
Bayesian Kernel Methods for Natural Language Processing	Jun 1, 2014	Feature EngineeringMachine Translation	—Unverified
Max-Margin Tensor Neural Network for Chinese Word Segmentation	Jun 1, 2014	Chinese Word SegmentationFeature Engineering	—Unverified
Word-Based Dialog State Tracking with Recurrent Neural Networks	Jun 1, 2014	dialog state trackingFeature Engineering	—Unverified
Linguistic Structured Sparsity in Text Categorization	Jun 1, 2014	Feature EngineeringLanguage Modelling	—Unverified
Improving Citation Polarity Classification with Product Reviews	Jun 1, 2014	ClassificationDomain Adaptation	—Unverified
Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification	Jun 1, 2014	ClassificationFeature Engineering	CodeCode Available
How to Use less Features and Reach Better Performance in Author Gender Identification	May 1, 2014	Author ProfilingDimensionality Reduction	—Unverified
Regularized Structured Perceptron: A Case Study on Chinese Word Segmentation, POS Tagging and Parsing	Apr 1, 2014	Chinese Word SegmentationDependency Parsing	—Unverified
Special Techniques for Constituent Parsing of Morphologically Rich Languages	Apr 1, 2014	Dependency ParsingFeature Engineering	—Unverified
Locally Non-Linear Learning for Statistical Machine Translation via Discretization and Structured Regularization	Jan 1, 2014	Feature EngineeringLanguage Modelling	—Unverified
Combination of Diverse Ranking Models for Personalized Expedia Hotel Searches	Nov 29, 2013	BIG-bench Machine LearningFeature Engineering	—Unverified
Automatic Feature Engineering for Answer Selection and Extraction	Oct 1, 2013	Answer SelectionFeature Engineering	—Unverified
Detection of Product Comparisons - How Far Does an Out-of-the-Box Semantic Role Labeling System Take You?	Oct 1, 2013	Feature EngineeringSemantic Role Labeling	—Unverified
Elephant: Sequence Labeling for Word and Sentence Segmentation	Oct 1, 2013	Boundary DetectionFeature Engineering	—Unverified
Exploring Representations from Unlabeled Data with Co-training for Chinese Word Segmentation	Oct 1, 2013	Chinese Word SegmentationFeature Engineering	—Unverified
Deep Learning for Chinese Word Segmentation and POS Tagging	Oct 1, 2013	Chinese Word SegmentationDeep Learning	—Unverified
A Feature Induction Algorithm with Application to Named Entity Disambiguation	Sep 1, 2013	Entity DisambiguationFeature Engineering	—Unverified
Dual Training and Dual Prediction for Polarity Classification	Aug 1, 2013	ClassificationFeature Engineering	—Unverified
LFG-based Features for Noun Number and Article Grammatical Errors	Aug 1, 2013	Feature EngineeringGrammatical Error Correction	—Unverified
Learning Adaptable Patterns for Passage Reranking	Aug 1, 2013	Feature EngineeringPassage Reranking	—Unverified
Reducing Annotation Effort for Quality Estimation via Active Learning	Aug 1, 2013	Active LearningFeature Engineering	—Unverified
Co-regularizing character-based and word-based models for semi-supervised Chinese word segmentation	Aug 1, 2013	Chinese Word SegmentationFeature Engineering	—Unverified
Parsing with Compositional Vector Grammars	Aug 1, 2013	Feature EngineeringRelation Extraction	—Unverified
Learning Semantic Textual Similarity with Structural Representations	Aug 1, 2013	Feature EngineeringNatural Language Inference	—Unverified
The Haves and the Have-Nots: Leveraging Unlabelled Corpora for Sentiment Analysis	Aug 1, 2013	Dependency ParsingFeature Engineering	—Unverified
Additive Neural Networks for Statistical Machine Translation	Aug 1, 2013	Feature EngineeringLanguage Modelling	—Unverified
Learning Non-linear Features for Machine Translation Using Gradient Boosting Machines	Aug 1, 2013	Feature EngineeringLanguage Modelling	—Unverified
Investigation of annotator's behaviour using eye-tracking data	Aug 1, 2013	ChunkingCoreference Resolution	—Unverified
Recurrent Convolutional Neural Networks for Discourse Compositionality	Jun 15, 2013	Dialogue Act ClassificationFeature Engineering	—Unverified
Feature Engineering in the NLI Shared Task 2013: Charles University Submission Report	Jun 1, 2013	Author ProfilingFeature Engineering	—Unverified
UNITOR: Combining Syntactic and Semantic Kernels for Twitter Sentiment Analysis	Jun 1, 2013	Feature EngineeringSentiment Analysis	—Unverified
UNITOR-HMM-TK: Structured Kernel-based learning for Spatial Role Labeling	Jun 1, 2013	Feature EngineeringRelation Classification	—Unverified
SZTE-NLP: Sentiment Detection on Twitter Messages	Jun 1, 2013	Feature EngineeringSentiment Analysis	—Unverified
WBI-NER: The impact of domain-specific features on the performance of identifying and classifying mentions of drugs	Jun 1, 2013	Feature EngineeringNamed Entity Recognition (NER)	—Unverified
Sentiment Analysis of Political Tweets: Towards an Accurate Classifier	Jun 1, 2013	Feature EngineeringSentiment Analysis	—Unverified
GU-MLT-LT: Sentiment Analysis of Short Messages using Linguistic Features and Stochastic Gradient Descent	Jun 1, 2013	Feature EngineeringOpinion Mining	—Unverified
Role of Morpho-Syntactic Features in Estonian Proficiency Classification	Jun 1, 2013	ClassificationFeature Engineering	—Unverified
ASVUniOfLeipzig: Sentiment Analysis in Twitter using Data-driven Machine Learning Techniques	Jun 1, 2013	BIG-bench Machine LearningFeature Engineering	—Unverified
Improved Temporal Relation Classification using Dependency Parses and Selective Crowdsourced Annotations	Dec 1, 2012	Dependency ParsingFeature Engineering	—Unverified
FeatureForge: A Novel Tool for Visually Supported Feature Engineering and Corpus Revision	Dec 1, 2012	Feature Engineering	—Unverified
Enhancement of Feature Engineering for Conditional Random Field Learning in Chinese Word Segmentation Using Unlabeled Data	Sep 1, 2012	Chinese Word SegmentationFeature Engineering	—Unverified
Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT	Jul 1, 2012	Feature Engineeringfeature selection	—Unverified
Deep Learning for NLP (without Magic)	Jul 1, 2012	Deep LearningFeature Engineering	—Unverified
Building Trainable Taggers in a Web-based, UIMA-Supported NLP Workbench	Jul 1, 2012	ChunkingFeature Engineering	—Unverified
Rep\'erage des entit\'es nomm\'ees pour l'arabe : adaptation non-supervis\'ee et combinaison de syst\`emes (Named Entity Recognition for Arabic : Unsupervised adaptation and Systems combination) [in French]	Jun 1, 2012	Domain AdaptationFeature Engineering	—Unverified

Show:10 25 50

← PrevPage 34 of 35Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN	14 gestures accuracy	0.98	—	Unverified