Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations.

Features

  • Stems
  • Analyzes Morphology
  • Accepts & Transliterates between UTF-8, Windows-1256, ISIRI-3342, HTML-style Numeric Character References, ArabTeX romanization, and Dehdari transliteration
  • Displays Part-of-Speech Tags for Many Words
  • Tokenizes
  • Handles Irregular Verbs, Semi-Regular Verbs, and Many Broken Plurals
  • Very Fast
  • Small Single File, Requiring no External Data

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow Perstem

Perstem Web Site

Other Useful Business Software
Powering the best of the internet | Fastly Icon
Powering the best of the internet | Fastly

Fastly's edge cloud platform delivers faster, safer, and more scalable sites and apps to customers.

Ensure your websites, applications and services can effortlessly handle the demands of your users with Fastly. Fastly’s portfolio is designed to be highly performant, personalized and secure while seamlessly scaling to support your growth.
Try for free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • Nice, thank you
    1 user found this review helpful.
Read more reviews >

Additional Project Details

Operating Systems

Linux, BSD

Languages

English

Intended Audience

Information Technology, Science/Research, Advanced End Users

User Interface

Web-based, Command-line

Programming Language

Perl

Related Categories

Perl Search Engines, Perl Linguistics Software, Perl Languages Software

Registered

2006-08-23