spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Qian Huang (JIRA)" <>
Subject [jira] [Commented] (SPARK-4036) Add Conditional Random Fields (CRF) algorithm to Spark MLlib
Date Tue, 16 Feb 2016 04:47:18 GMT


Qian Huang commented on SPARK-4036:

Hi, I have created a spark package,
I co-work on it with hujiayin.

[~josephkb] It can be said that this package is a spark-based re-implementation of crf++.
It has the same limit as crf++ has, like feature generator design and only for "segmenting/labeling
sequential data". But it basically meets the requirement of NLP and can run in parallel for
big data.

Welcome to try it. If you encounter bugs, feel free to submit an issue or pull request.

> Add Conditional Random Fields (CRF) algorithm to Spark MLlib
> ------------------------------------------------------------
>                 Key: SPARK-4036
>                 URL:
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>            Reporter: Guoqiang Li
>            Assignee: Kai Sasaki
>         Attachments: CRF_design.1.pdf,, dig-hair-eye-train.model,,
sample-input, sample-output
> Conditional random fields (CRFs) are a class of statistical modelling method often applied
in pattern recognition and machine learning, where they are used for structured prediction.

> The paper: 

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message