Package: mlvocab
Title: Vocabulary and Corpus Preprocessing for Natural Language
        Pipelines
Version: 0.0.1
Authors@R: person("Vitalie", "Spinu", email = "spinuvit@gmail.com", role = c("aut", "cre"))
Description: Utilities for preprocessing of text corpora into data structures
  suitable for natural language models: integer sequences or matrices,
  vocabulary embedding matrices, term-doc, doc-term, term co-occurrence matrices
  etc. All functions allow for full or partial hashing of the terms in the
  vocabulary.
Depends: R (>= 3.4.0)
License: GPL-3
Encoding: UTF-8
Imports: Rcpp (>= 0.12), Matrix, digest (>= 0.6.8), sparsepp (>= 0.2.0)
LinkingTo: Rcpp, digest (>= 0.6.8), sparsepp (>= 0.2.0)
Suggests: testthat, knitr
LazyData: true
SystemRequirements: C++11
BugReports: https://github.com/vspinu/mlvocab/issues
URL: https://github.com/vspinu/mlvocab/
RoxygenNote: 6.0.1
NeedsCompilation: yes
Packaged: 2018-04-12 19:02:49 UTC; vspinu
Author: Vitalie Spinu [aut, cre]
Maintainer: Vitalie Spinu <spinuvit@gmail.com>
Repository: CRAN
Date/Publication: 2018-04-13 08:50:01 UTC
