Files
df-research/tech.ml.dataset/docs/index.html
2026-02-08 11:20:43 -10:00

60 lines
54 KiB
HTML
Vendored

<!DOCTYPE html PUBLIC ""
"">
<html><head><meta charset="UTF-8" /><title>TMD 8.003</title><script async="true" src="https://www.googletagmanager.com/gtag/js?id=G-RGTB4J7LGP"></script><script>window.dataLayer = window.dataLayer || [];
function gtag(){dataLayer.push(arguments);}
gtag('js', new Date());
gtag('config', 'G-95TVFC1FEB');</script><link rel="stylesheet" type="text/css" href="css/default.css" /><link rel="stylesheet" type="text/css" href="highlight/solarized-light.css" /><script type="text/javascript" src="highlight/highlight.min.js"></script><script type="text/javascript" src="js/jquery.min.js"></script><script type="text/javascript" src="js/page_effects.js"></script><script>hljs.initHighlightingOnLoad();</script></head><body><div id="header"><h2>Generated by <a href="https://github.com/weavejester/codox">Codox</a> with <a href="https://github.com/xsc/codox-theme-rdash">RDash UI</a> theme</h2><h1><a href="index.html"><span class="project-title"><span class="project-name">TMD</span> <span class="project-version">8.003</span></span></a></h1></div><div class="sidebar primary"><h3 class="no-link"><span class="inner">Project</span></h3><ul class="index-link"><li class="depth-1 current"><a href="index.html"><div class="inner">Index</div></a></li></ul><h3 class="no-link"><span class="inner">Topics</span></h3><ul><li class="depth-1 "><a href="000-getting-started.html"><div class="inner"><span>tech.ml.dataset Getting Started</span></div></a></li><li class="depth-1 "><a href="100-walkthrough.html"><div class="inner"><span>tech.ml.dataset Walkthrough</span></div></a></li><li class="depth-1 "><a href="200-quick-reference.html"><div class="inner"><span>tech.ml.dataset Quick Reference</span></div></a></li><li class="depth-1 "><a href="columns-readers-and-datatypes.html"><div class="inner"><span>tech.ml.dataset Columns, Readers, and Datatypes</span></div></a></li><li class="depth-1 "><a href="nippy-serialization-rocks.html"><div class="inner"><span>tech.ml.dataset And nippy</span></div></a></li><li class="depth-1 "><a href="supported-datatypes.html"><div class="inner"><span>tech.ml.dataset Supported Datatypes</span></div></a></li></ul><h3 class="no-link"><span class="inner">Namespaces</span></h3><ul><li class="depth-1"><div class="no-link"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>tech</span></div></div></li><li class="depth-2"><div class="no-link"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>v3</span></div></div></li><li class="depth-3"><a href="tech.v3.dataset.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>dataset</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.categorical.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>categorical</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.clipboard.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>clipboard</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.column.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>column</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.column-filters.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>column-filters</span></div></a></li><li class="depth-4"><div class="no-link"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>io</span></div></div></li><li class="depth-5 branch"><a href="tech.v3.dataset.io.csv.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>csv</span></div></a></li><li class="depth-5 branch"><a href="tech.v3.dataset.io.datetime.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>datetime</span></div></a></li><li class="depth-5 branch"><a href="tech.v3.dataset.io.string-row-parser.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>string-row-parser</span></div></a></li><li class="depth-5"><a href="tech.v3.dataset.io.univocity.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>univocity</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.join.html"><div class="inner"><span class="tree" style="top: -145px;"><span class="top" style="height: 154px;"></span><span class="bottom"></span></span><span>join</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.math.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>math</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.metamorph.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>metamorph</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.modelling.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>modelling</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.print.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>print</span></div></a></li><li class="depth-4"><a href="tech.v3.dataset.reductions.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>reductions</span></div></a></li><li class="depth-5"><a href="tech.v3.dataset.reductions.apache-data-sketch.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>apache-data-sketch</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.rolling.html"><div class="inner"><span class="tree" style="top: -52px;"><span class="top" style="height: 61px;"></span><span class="bottom"></span></span><span>rolling</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.set.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>set</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.dataset.tensor.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>tensor</span></div></a></li><li class="depth-4"><a href="tech.v3.dataset.zip.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>zip</span></div></a></li><li class="depth-3"><div class="no-link"><div class="inner"><span class="tree" style="top: -641px;"><span class="top" style="height: 650px;"></span><span class="bottom"></span></span><span>libs</span></div></div></li><li class="depth-4 branch"><a href="tech.v3.libs.arrow.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>arrow</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.libs.clj-transit.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>clj-transit</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.libs.fastexcel.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>fastexcel</span></div></a></li><li class="depth-4"><div class="no-link"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>guava</span></div></div></li><li class="depth-5"><a href="tech.v3.libs.guava.cache.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>cache</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.libs.parquet.html"><div class="inner"><span class="tree" style="top: -52px;"><span class="top" style="height: 61px;"></span><span class="bottom"></span></span><span>parquet</span></div></a></li><li class="depth-4 branch"><a href="tech.v3.libs.poi.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>poi</span></div></a></li><li class="depth-4"><a href="tech.v3.libs.tribuo.html"><div class="inner"><span class="tree"><span class="top"></span><span class="bottom"></span></span><span>tribuo</span></div></a></li></ul></div><div class="namespace-index" id="content"><h1><span class="project-title"><span class="project-name">TMD</span> <span class="project-version">8.003</span></span></h1><div class="doc"><p>A Clojure high performance data processing system.</p></div><h2>Topics</h2><ul class="topics"><li><a href="000-getting-started.html">tech.ml.dataset Getting Started</a></li><li><a href="100-walkthrough.html">tech.ml.dataset Walkthrough</a></li><li><a href="200-quick-reference.html">tech.ml.dataset Quick Reference</a></li><li><a href="columns-readers-and-datatypes.html">tech.ml.dataset Columns, Readers, and Datatypes</a></li><li><a href="nippy-serialization-rocks.html">tech.ml.dataset And nippy</a></li><li><a href="supported-datatypes.html">tech.ml.dataset Supported Datatypes</a></li></ul><h2>Namespaces</h2><div class="namespace"><h3><a href="tech.v3.dataset.html">tech.v3.dataset</a></h3><div class="doc"><div class="markdown"><p>Column major dataset abstraction for efficiently manipulating
in memory datasets.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.html#var--.3E.3Edataset">-&gt;&gt;dataset</a> </li><li> <a href="tech.v3.dataset.html#var--.3Edataset">-&gt;dataset</a> </li><li> <a href="tech.v3.dataset.html#var-add-column">add-column</a> </li><li> <a href="tech.v3.dataset.html#var-add-or-update-column">add-or-update-column</a> </li><li> <a href="tech.v3.dataset.html#var-all-descriptive-stats-names">all-descriptive-stats-names</a> </li><li> <a href="tech.v3.dataset.html#var-append-columns">append-columns</a> </li><li> <a href="tech.v3.dataset.html#var-assoc-ds">assoc-ds</a> </li><li> <a href="tech.v3.dataset.html#var-assoc-metadata">assoc-metadata</a> </li><li> <a href="tech.v3.dataset.html#var-bind-.3E">bind-&gt;</a> </li><li> <a href="tech.v3.dataset.html#var-brief">brief</a> </li><li> <a href="tech.v3.dataset.html#var-categorical-.3Enumber">categorical-&gt;number</a> </li><li> <a href="tech.v3.dataset.html#var-categorical-.3Eone-hot">categorical-&gt;one-hot</a> </li><li> <a href="tech.v3.dataset.html#var-column">column</a> </li><li> <a href="tech.v3.dataset.html#var-column-.3Edataset">column-&gt;dataset</a> </li><li> <a href="tech.v3.dataset.html#var-column-cast">column-cast</a> </li><li> <a href="tech.v3.dataset.html#var-column-count">column-count</a> </li><li> <a href="tech.v3.dataset.html#var-column-labeled-mapseq">column-labeled-mapseq</a> </li><li> <a href="tech.v3.dataset.html#var-column-map">column-map</a> </li><li> <a href="tech.v3.dataset.html#var-column-map-m">column-map-m</a> </li><li> <a href="tech.v3.dataset.html#var-column-names">column-names</a> </li><li> <a href="tech.v3.dataset.html#var-columns">columns</a> </li><li> <a href="tech.v3.dataset.html#var-columns-with-missing-seq">columns-with-missing-seq</a> </li><li> <a href="tech.v3.dataset.html#var-columnwise-concat">columnwise-concat</a> </li><li> <a href="tech.v3.dataset.html#var-concat">concat</a> </li><li> <a href="tech.v3.dataset.html#var-concat-copying">concat-copying</a> </li><li> <a href="tech.v3.dataset.html#var-concat-inplace">concat-inplace</a> </li><li> <a href="tech.v3.dataset.html#var-data-.3Edataset">data-&gt;dataset</a> </li><li> <a href="tech.v3.dataset.html#var-dataset-.3Edata">dataset-&gt;data</a> </li><li> <a href="tech.v3.dataset.html#var-dataset-name">dataset-name</a> </li><li> <a href="tech.v3.dataset.html#var-dataset-parser">dataset-parser</a> </li><li> <a href="tech.v3.dataset.html#var-dataset.3F">dataset?</a> </li><li> <a href="tech.v3.dataset.html#var-descriptive-stats">descriptive-stats</a> </li><li> <a href="tech.v3.dataset.html#var-drop-columns">drop-columns</a> </li><li> <a href="tech.v3.dataset.html#var-drop-missing">drop-missing</a> </li><li> <a href="tech.v3.dataset.html#var-drop-rows">drop-rows</a> </li><li> <a href="tech.v3.dataset.html#var-empty-column-names">empty-column-names</a> </li><li> <a href="tech.v3.dataset.html#var-empty-dataset">empty-dataset</a> </li><li> <a href="tech.v3.dataset.html#var-ensure-array-backed">ensure-array-backed</a> </li><li> <a href="tech.v3.dataset.html#var-filter">filter</a> </li><li> <a href="tech.v3.dataset.html#var-filter-column">filter-column</a> </li><li> <a href="tech.v3.dataset.html#var-filter-dataset">filter-dataset</a> </li><li> <a href="tech.v3.dataset.html#var-group-by">group-by</a> </li><li> <a href="tech.v3.dataset.html#var-group-by-.3Eindexes">group-by-&gt;indexes</a> </li><li> <a href="tech.v3.dataset.html#var-group-by-column">group-by-column</a> </li><li> <a href="tech.v3.dataset.html#var-group-by-column-.3Eindexes">group-by-column-&gt;indexes</a> </li><li> <a href="tech.v3.dataset.html#var-group-by-column-consumer">group-by-column-consumer</a> </li><li> <a href="tech.v3.dataset.html#var-has-column.3F">has-column?</a> </li><li> <a href="tech.v3.dataset.html#var-head">head</a> </li><li> <a href="tech.v3.dataset.html#var-induction">induction</a> </li><li> <a href="tech.v3.dataset.html#var-major-version">major-version</a> </li><li> <a href="tech.v3.dataset.html#var-mapseq-parser">mapseq-parser</a> </li><li> <a href="tech.v3.dataset.html#var-mapseq-reader">mapseq-reader</a> </li><li> <a href="tech.v3.dataset.html#var-mapseq-rf">mapseq-rf</a> </li><li> <a href="tech.v3.dataset.html#var-min-n-by-column">min-n-by-column</a> </li><li> <a href="tech.v3.dataset.html#var-missing">missing</a> </li><li> <a href="tech.v3.dataset.html#var-new-column">new-column</a> </li><li> <a href="tech.v3.dataset.html#var-new-dataset">new-dataset</a> </li><li> <a href="tech.v3.dataset.html#var-order-column-names">order-column-names</a> </li><li> <a href="tech.v3.dataset.html#var-pmap-ds">pmap-ds</a> </li><li> <a href="tech.v3.dataset.html#var-print-all">print-all</a> </li><li> <a href="tech.v3.dataset.html#var-rand-nth">rand-nth</a> </li><li> <a href="tech.v3.dataset.html#var-remove-column">remove-column</a> </li><li> <a href="tech.v3.dataset.html#var-remove-columns">remove-columns</a> </li><li> <a href="tech.v3.dataset.html#var-remove-empty-columns">remove-empty-columns</a> </li><li> <a href="tech.v3.dataset.html#var-remove-rows">remove-rows</a> </li><li> <a href="tech.v3.dataset.html#var-rename-columns">rename-columns</a> </li><li> <a href="tech.v3.dataset.html#var-replace-missing">replace-missing</a> </li><li> <a href="tech.v3.dataset.html#var-replace-missing-value">replace-missing-value</a> </li><li> <a href="tech.v3.dataset.html#var-reverse-rows">reverse-rows</a> </li><li> <a href="tech.v3.dataset.html#var-row-at">row-at</a> </li><li> <a href="tech.v3.dataset.html#var-row-count">row-count</a> </li><li> <a href="tech.v3.dataset.html#var-row-map">row-map</a> </li><li> <a href="tech.v3.dataset.html#var-row-mapcat">row-mapcat</a> </li><li> <a href="tech.v3.dataset.html#var-rows">rows</a> </li><li> <a href="tech.v3.dataset.html#var-rowvec-at">rowvec-at</a> </li><li> <a href="tech.v3.dataset.html#var-rowvecs">rowvecs</a> </li><li> <a href="tech.v3.dataset.html#var-sample">sample</a> </li><li> <a href="tech.v3.dataset.html#var-select">select</a> </li><li> <a href="tech.v3.dataset.html#var-select-by-index">select-by-index</a> </li><li> <a href="tech.v3.dataset.html#var-select-columns">select-columns</a> </li><li> <a href="tech.v3.dataset.html#var-select-columns-by-index">select-columns-by-index</a> </li><li> <a href="tech.v3.dataset.html#var-select-missing">select-missing</a> </li><li> <a href="tech.v3.dataset.html#var-select-rows">select-rows</a> </li><li> <a href="tech.v3.dataset.html#var-set-dataset-name">set-dataset-name</a> </li><li> <a href="tech.v3.dataset.html#var-shape">shape</a> </li><li> <a href="tech.v3.dataset.html#var-shuffle">shuffle</a> </li><li> <a href="tech.v3.dataset.html#var-sort-by">sort-by</a> </li><li> <a href="tech.v3.dataset.html#var-sort-by-column">sort-by-column</a> </li><li> <a href="tech.v3.dataset.html#var-tail">tail</a> </li><li> <a href="tech.v3.dataset.html#var-take-nth">take-nth</a> </li><li> <a href="tech.v3.dataset.html#var-unique-by">unique-by</a> </li><li> <a href="tech.v3.dataset.html#var-unique-by-column">unique-by-column</a> </li><li> <a href="tech.v3.dataset.html#var-unordered-select">unordered-select</a> </li><li> <a href="tech.v3.dataset.html#var-unroll-column">unroll-column</a> </li><li> <a href="tech.v3.dataset.html#var-update">update</a> </li><li> <a href="tech.v3.dataset.html#var-update-column">update-column</a> </li><li> <a href="tech.v3.dataset.html#var-update-columns">update-columns</a> </li><li> <a href="tech.v3.dataset.html#var-update-columnwise">update-columnwise</a> </li><li> <a href="tech.v3.dataset.html#var-update-elemwise">update-elemwise</a> </li><li> <a href="tech.v3.dataset.html#var-value-reader">value-reader</a> </li><li> <a href="tech.v3.dataset.html#var-write.21">write!</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.categorical.html">tech.v3.dataset.categorical</a></h3><div class="doc"><div class="markdown"><p>Conversions of categorical values into numbers and back. Two forms of conversions
are supported, a straight value-&gt;integer map and one-hot encoding.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.categorical.html#var-dataset-.3Ecategorical-maps">dataset-&gt;categorical-maps</a> </li><li> <a href="tech.v3.dataset.categorical.html#var-fit-categorical-map">fit-categorical-map</a> </li><li> <a href="tech.v3.dataset.categorical.html#var-fit-one-hot">fit-one-hot</a> </li><li> <a href="tech.v3.dataset.categorical.html#var-invert-categorical-map">invert-categorical-map</a> </li><li> <a href="tech.v3.dataset.categorical.html#var-invert-one-hot-map">invert-one-hot-map</a> </li><li> <a href="tech.v3.dataset.categorical.html#var-reverse-map-categorical-xforms">reverse-map-categorical-xforms</a> </li><li> <a href="tech.v3.dataset.categorical.html#var-transform-categorical-map">transform-categorical-map</a> </li><li> <a href="tech.v3.dataset.categorical.html#var-transform-one-hot">transform-one-hot</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.clipboard.html">tech.v3.dataset.clipboard</a></h3><div class="doc"><div class="markdown"><p>Optional namespace that copies a dataset to the clipboard for pasting into
applications such as excel or google sheets.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.clipboard.html#var-clipboard">clipboard</a> </li><li> <a href="tech.v3.dataset.clipboard.html#var-clipboard-.3Edataset">clipboard-&gt;dataset</a> </li><li> <a href="tech.v3.dataset.clipboard.html#var-dataset-.3Eclipboard">dataset-&gt;clipboard</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.column.html">tech.v3.dataset.column</a></h3><div class="doc"><div class="markdown"></div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.column.html#var-clone">clone</a> </li><li> <a href="tech.v3.dataset.column.html#var-column-map">column-map</a> </li><li> <a href="tech.v3.dataset.column.html#var-column-name">column-name</a> </li><li> <a href="tech.v3.dataset.column.html#var-correlation">correlation</a> </li><li> <a href="tech.v3.dataset.column.html#var-extend-column-with-empty">extend-column-with-empty</a> </li><li> <a href="tech.v3.dataset.column.html#var-intersect-missing-sets">intersect-missing-sets</a> </li><li> <a href="tech.v3.dataset.column.html#var-is-column.3F">is-column?</a> </li><li> <a href="tech.v3.dataset.column.html#var-is-missing.3F">is-missing?</a> </li><li> <a href="tech.v3.dataset.column.html#var-missing">missing</a> </li><li> <a href="tech.v3.dataset.column.html#var-new-column">new-column</a> </li><li> <a href="tech.v3.dataset.column.html#var-parse-column">parse-column</a> </li><li> <a href="tech.v3.dataset.column.html#var-prepend-column-with-empty">prepend-column-with-empty</a> </li><li> <a href="tech.v3.dataset.column.html#var-select">select</a> </li><li> <a href="tech.v3.dataset.column.html#var-set-missing">set-missing</a> </li><li> <a href="tech.v3.dataset.column.html#var-set-name">set-name</a> </li><li> <a href="tech.v3.dataset.column.html#var-stats">stats</a> </li><li> <a href="tech.v3.dataset.column.html#var-string-table-keyset">string-table-keyset</a> </li><li> <a href="tech.v3.dataset.column.html#var-supported-stats">supported-stats</a> </li><li> <a href="tech.v3.dataset.column.html#var-to-double-array">to-double-array</a> </li><li> <a href="tech.v3.dataset.column.html#var-union-missing-sets">union-missing-sets</a> </li><li> <a href="tech.v3.dataset.column.html#var-unique">unique</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.column-filters.html">tech.v3.dataset.column-filters</a></h3><div class="doc"><div class="markdown"><p>Queries to select column subsets that have various properites such as all numeric
columns, all feature columns, or columns that have a specific datatype.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.column-filters.html#var-boolean">boolean</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-categorical">categorical</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-column-filter">column-filter</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-datetime">datetime</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-difference">difference</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-feature">feature</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-intersection">intersection</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-metadata-filter">metadata-filter</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-missing">missing</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-no-missing">no-missing</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-numeric">numeric</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-of-datatype">of-datatype</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-prediction">prediction</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-probability-distribution">probability-distribution</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-string">string</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-target">target</a> </li><li> <a href="tech.v3.dataset.column-filters.html#var-union">union</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.io.csv.html">tech.v3.dataset.io.csv</a></h3><div class="doc"><div class="markdown"><p>CSV parsing based on <a href="https://cnuernber.github.io/charred/">charred.api/read-csv</a>.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.io.csv.html#var-csv-.3Edataset">csv-&gt;dataset</a> </li><li> <a href="tech.v3.dataset.io.csv.html#var-csv-.3Edataset-seq">csv-&gt;dataset-seq</a> </li><li> <a href="tech.v3.dataset.io.csv.html#var-rows-.3Ecsv.21">rows-&gt;csv!</a> </li><li> <a href="tech.v3.dataset.io.csv.html#var-rows-.3Edataset-fn">rows-&gt;dataset-fn</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.io.datetime.html">tech.v3.dataset.io.datetime</a></h3><div class="doc"><div class="markdown"><p>Helpful and well tested string-&gt;datetime pathways.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.io.datetime.html#var-datatype-.3Egeneral-parse-fn-map">datatype-&gt;general-parse-fn-map</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-datetime-formatter-or-str-.3Eparser-fn">datetime-formatter-or-str-&gt;parser-fn</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-datetime-formatter-parse-str-fn">datetime-formatter-parse-str-fn</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-local-date-formatter">local-date-formatter</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-local-date-parser-patterns">local-date-parser-patterns</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-local-time-formatter">local-time-formatter</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-parse-duration">parse-duration</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-parse-local-date">parse-local-date</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-parse-local-date-time">parse-local-date-time</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-parse-local-time">parse-local-time</a> </li><li> <a href="tech.v3.dataset.io.datetime.html#var-time-parser-patterns">time-parser-patterns</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.io.string-row-parser.html">tech.v3.dataset.io.string-row-parser</a></h3><div class="doc"><div class="markdown"><p>Parsing functions based on raw data that is represented by a sequence
of string arrays.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.io.string-row-parser.html#var-partition-all-rows">partition-all-rows</a> </li><li> <a href="tech.v3.dataset.io.string-row-parser.html#var-rows-.3Edataset">rows-&gt;dataset</a> </li><li> <a href="tech.v3.dataset.io.string-row-parser.html#var-sample-rows">sample-rows</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.io.univocity.html">tech.v3.dataset.io.univocity</a></h3><div class="doc"><div class="markdown"><p>Bindings to univocity. Transforms csv's, tsv's into sequences
of string arrays that are then passed into <code>tech.v3.dataset.io.string-row-parser</code>
methods.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.io.univocity.html#var-create-csv-parser">create-csv-parser</a> </li><li> <a href="tech.v3.dataset.io.univocity.html#var-csv-.3Edataset">csv-&gt;dataset</a> </li><li> <a href="tech.v3.dataset.io.univocity.html#var-csv-.3Erows">csv-&gt;rows</a> </li><li> <a href="tech.v3.dataset.io.univocity.html#var-PApplyWriteOptions">PApplyWriteOptions</a> </li><li> <a href="tech.v3.dataset.io.univocity.html#var-raw-row-iterable">raw-row-iterable</a> </li><li> <a href="tech.v3.dataset.io.univocity.html#var-rows-.3Ecsv.21">rows-&gt;csv!</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.join.html">tech.v3.dataset.join</a></h3><div class="doc"><div class="markdown"><p>implementation of join algorithms, both exact (hash-join) and near.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.join.html#var-hash-join">hash-join</a> </li><li> <a href="tech.v3.dataset.join.html#var-inner-join">inner-join</a> </li><li> <a href="tech.v3.dataset.join.html#var-left-join">left-join</a> </li><li> <a href="tech.v3.dataset.join.html#var-left-join-asof">left-join-asof</a> </li><li> <a href="tech.v3.dataset.join.html#var-pd-merge">pd-merge</a> </li><li> <a href="tech.v3.dataset.join.html#var-right-join">right-join</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.math.html">tech.v3.dataset.math</a></h3><div class="doc"><div class="markdown"><p>Various mathematic transformations of datasets such as (inefficiently)
building simple tables, pca, and normalizing columns to have mean of 0 and variance of 1.
More in-depth transformations are found at <code>tech.v3.dataset.neanderthal</code>.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.math.html#var-correlation-table">correlation-table</a> </li><li> <a href="tech.v3.dataset.math.html#var-fill-range-replace">fill-range-replace</a> </li><li> <a href="tech.v3.dataset.math.html#var-fit-minmax">fit-minmax</a> </li><li> <a href="tech.v3.dataset.math.html#var-fit-std-scale">fit-std-scale</a> </li><li> <a href="tech.v3.dataset.math.html#var-interpolate-loess">interpolate-loess</a> </li><li> <a href="tech.v3.dataset.math.html#var-transform-minmax">transform-minmax</a> </li><li> <a href="tech.v3.dataset.math.html#var-transform-std-scale">transform-std-scale</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.metamorph.html">tech.v3.dataset.metamorph</a></h3><div class="doc"><div class="markdown"><p>This is an auto-generated api system - it scans the namespaces and changes the first
to be metamorph-compliant which means transforming an argument that is just a dataset into
an argument that is a metamorph context - a map of <code>{:metamorph/data ds}</code>. They also return
their result as a metamorph context.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.metamorph.html#var-add-column">add-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-add-or-update-column">add-or-update-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-append-columns">append-columns</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-assoc-ds">assoc-ds</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-assoc-metadata">assoc-metadata</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-brief">brief</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-build-pipelined-function">build-pipelined-function</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-categorical-.3Enumber">categorical-&gt;number</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-categorical-.3Eone-hot">categorical-&gt;one-hot</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-column">column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-column-.3Edataset">column-&gt;dataset</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-column-cast">column-cast</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-column-count">column-count</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-column-labeled-mapseq">column-labeled-mapseq</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-column-map">column-map</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-column-names">column-names</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-column-values-.3Ecategorical">column-values-&gt;categorical</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-columns">columns</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-columns-with-missing-seq">columns-with-missing-seq</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-columnwise-concat">columnwise-concat</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-concat">concat</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-concat-copying">concat-copying</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-concat-inplace">concat-inplace</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-data-.3Edataset">data-&gt;dataset</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-dataset-.3Ecategorical-xforms">dataset-&gt;categorical-xforms</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-dataset-.3Edata">dataset-&gt;data</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-dataset-name">dataset-name</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-dataset.3F">dataset?</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-descriptive-stats">descriptive-stats</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-drop-columns">drop-columns</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-drop-missing">drop-missing</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-drop-rows">drop-rows</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-empty-column-names">empty-column-names</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-empty-dataset">empty-dataset</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-ensure-array-backed">ensure-array-backed</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-feature-ecount">feature-ecount</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-filter">filter</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-filter-column">filter-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-filter-dataset">filter-dataset</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-group-by">group-by</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-group-by-.3Eindexes">group-by-&gt;indexes</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-group-by-column">group-by-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-group-by-column-.3Eindexes">group-by-column-&gt;indexes</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-group-by-column-consumer">group-by-column-consumer</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-has-column.3F">has-column?</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-head">head</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-induction">induction</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-inference-column.3F">inference-column?</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-inference-target-column-names">inference-target-column-names</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-inference-target-ds">inference-target-ds</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-inference-target-label-inverse-map">inference-target-label-inverse-map</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-inference-target-label-map">inference-target-label-map</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-k-fold-datasets">k-fold-datasets</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-labels">labels</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-mapseq-reader">mapseq-reader</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-min-n-by-column">min-n-by-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-missing">missing</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-model-type">model-type</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-new-column">new-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-new-dataset">new-dataset</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-num-inference-classes">num-inference-classes</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-order-column-names">order-column-names</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-pmap-ds">pmap-ds</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-print-all">print-all</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-probability-distributions-.3Elabel-column">probability-distributions-&gt;label-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-rand-nth">rand-nth</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-remove-column">remove-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-remove-columns">remove-columns</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-remove-empty-columns">remove-empty-columns</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-remove-rows">remove-rows</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-rename-columns">rename-columns</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-replace-missing">replace-missing</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-replace-missing-value">replace-missing-value</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-reverse-rows">reverse-rows</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-row-at">row-at</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-row-count">row-count</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-row-map">row-map</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-row-mapcat">row-mapcat</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-rows">rows</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-rowvec-at">rowvec-at</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-rowvecs">rowvecs</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-sample">sample</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-select">select</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-select-by-index">select-by-index</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-select-columns">select-columns</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-select-columns-by-index">select-columns-by-index</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-select-missing">select-missing</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-select-rows">select-rows</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-set-dataset-name">set-dataset-name</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-set-inference-target">set-inference-target</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-shape">shape</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-shuffle">shuffle</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-sort-by">sort-by</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-sort-by-column">sort-by-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-tail">tail</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-take-nth">take-nth</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-train-test-split">train-test-split</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-unique-by">unique-by</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-unique-by-column">unique-by-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-unordered-select">unordered-select</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-unroll-column">unroll-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-update">update</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-update-column">update-column</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-update-columns">update-columns</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-update-columnwise">update-columnwise</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-update-elemwise">update-elemwise</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-value-reader">value-reader</a> </li><li> <a href="tech.v3.dataset.metamorph.html#var-write.21">write!</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.modelling.html">tech.v3.dataset.modelling</a></h3><div class="doc"><div class="markdown"><p>Methods related specifically to machine learning such as setting the inference
target. This file integrates tightly with tech.v3.dataset.categorical which provides
categorical -&gt; number and one-hot transformation pathways.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.modelling.html#var-column-values-.3Ecategorical">column-values-&gt;categorical</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-dataset-.3Ecategorical-xforms">dataset-&gt;categorical-xforms</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-feature-ecount">feature-ecount</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-inference-column.3F">inference-column?</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-inference-target-column-names">inference-target-column-names</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-inference-target-ds">inference-target-ds</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-inference-target-label-inverse-map">inference-target-label-inverse-map</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-inference-target-label-map">inference-target-label-map</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-k-fold-datasets">k-fold-datasets</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-labels">labels</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-model-type">model-type</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-num-inference-classes">num-inference-classes</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-probability-distributions-.3Elabel-column">probability-distributions-&gt;label-column</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-set-inference-target">set-inference-target</a> </li><li> <a href="tech.v3.dataset.modelling.html#var-train-test-split">train-test-split</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.print.html">tech.v3.dataset.print</a></h3><div class="doc"><div class="markdown"></div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.print.html#var-dataset-.3Estr">dataset-&gt;str</a> </li><li> <a href="tech.v3.dataset.print.html#var-dataset-data-.3Estr">dataset-data-&gt;str</a> </li><li> <a href="tech.v3.dataset.print.html#var-print-policy">print-policy</a> </li><li> <a href="tech.v3.dataset.print.html#var-print-range">print-range</a> </li><li> <a href="tech.v3.dataset.print.html#var-print-types">print-types</a> </li><li> <a href="tech.v3.dataset.print.html#var-print-width">print-width</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.reductions.html">tech.v3.dataset.reductions</a></h3><div class="doc"><div class="markdown"><p>Specific high performance reductions intended to be performend over a sequence
of datasets. This allows aggregations to be done in situations where the dataset is
larger than what will fit in memory on a normal machine. Due to this fact, summation
is implemented using Kahan algorithm and various statistical methods are done in using
statistical estimation techniques and thus are prefixed with <code>prob-</code> which is short
for <code>probabilistic</code>.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.reductions.html#var-aggregate">aggregate</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-count-distinct">count-distinct</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-distinct">distinct</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-distinct-int32">distinct-int32</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-first-value">first-value</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-group-by-column-agg">group-by-column-agg</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-group-by-column-agg-rf">group-by-column-agg-rf</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-maximum">maximum</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-maximum-rf">maximum-rf</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-mean">mean</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-prob-cdf">prob-cdf</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-prob-interquartile-range">prob-interquartile-range</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-prob-median">prob-median</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-prob-quantile">prob-quantile</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-prob-set-cardinality">prob-set-cardinality</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-reducer">reducer</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-reducer-.3Ecolumn-reducer">reducer-&gt;column-reducer</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-reservoir-dataset">reservoir-dataset</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-reservoir-desc-stat">reservoir-desc-stat</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-row-count">row-count</a> </li><li> <a href="tech.v3.dataset.reductions.html#var-sum">sum</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.reductions.apache-data-sketch.html">tech.v3.dataset.reductions.apache-data-sketch</a></h3><div class="doc"><div class="markdown"><p>Reduction reducers based on the apache data sketch family of algorithms.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-doubles-sketch-reducer">doubles-sketch-reducer</a> </li><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-hll-reducer">hll-reducer</a> </li><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-prob-cdf">prob-cdf</a> </li><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-prob-cdfs">prob-cdfs</a> </li><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-prob-interquartile-range">prob-interquartile-range</a> </li><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-prob-median">prob-median</a> </li><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-prob-pmfs">prob-pmfs</a> </li><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-prob-quantile">prob-quantile</a> </li><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-prob-quantiles">prob-quantiles</a> </li><li> <a href="tech.v3.dataset.reductions.apache-data-sketch.html#var-prob-set-cardinality">prob-set-cardinality</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.rolling.html">tech.v3.dataset.rolling</a></h3><div class="doc"><div class="markdown"><p>Implement a generalized rolling window including support for time-based variable
width windows.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.rolling.html#var-expanding">expanding</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-first">first</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-last">last</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-max">max</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-mean">mean</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-min">min</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-nth">nth</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-rolling">rolling</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-standard-deviation">standard-deviation</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-sum">sum</a> </li><li> <a href="tech.v3.dataset.rolling.html#var-variance">variance</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.set.html">tech.v3.dataset.set</a></h3><div class="doc"><div class="markdown"><p>Extensions to datasets to do per-row bag-semantics set/union and intersection.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.set.html#var-difference">difference</a> </li><li> <a href="tech.v3.dataset.set.html#var-intersection">intersection</a> </li><li> <a href="tech.v3.dataset.set.html#var-reduce-intersection">reduce-intersection</a> </li><li> <a href="tech.v3.dataset.set.html#var-reduce-union">reduce-union</a> </li><li> <a href="tech.v3.dataset.set.html#var-union">union</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.tensor.html">tech.v3.dataset.tensor</a></h3><div class="doc"><div class="markdown"><p>Conversion mechanisms from dataset to tensor and back.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.tensor.html#var-dataset-.3Etensor">dataset-&gt;tensor</a> </li><li> <a href="tech.v3.dataset.tensor.html#var-mean-center-columns.21">mean-center-columns!</a> </li><li> <a href="tech.v3.dataset.tensor.html#var-tensor-.3Edataset">tensor-&gt;dataset</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.dataset.zip.html">tech.v3.dataset.zip</a></h3><div class="doc"><div class="markdown"><p>Load zip data. Zip files with a single file entry can be loaded with -&gt;dataset. When
a zip file has multiple entries you have to call zipfile-&gt;dataset-seq.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.dataset.zip.html#var-dataset-seq-.3Ezipfile.21">dataset-seq-&gt;zipfile!</a> </li><li> <a href="tech.v3.dataset.zip.html#var-zipfile-.3Edataset-seq">zipfile-&gt;dataset-seq</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.libs.arrow.html">tech.v3.libs.arrow</a></h3><div class="doc"><div class="markdown"><p>Support for reading/writing apache arrow datasets. Datasets may be memory mapped
but default to being read via an input stream.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.libs.arrow.html#var-col-.3Ebuffers">col-&gt;buffers</a> </li><li> <a href="tech.v3.libs.arrow.html#var-construct-column">construct-column</a> </li><li> <a href="tech.v3.libs.arrow.html#var-dataset-.3Estream.21">dataset-&gt;stream!</a> </li><li> <a href="tech.v3.libs.arrow.html#var-dataset-seq-.3Estream.21">dataset-seq-&gt;stream!</a> </li><li> <a href="tech.v3.libs.arrow.html#var-decimal-column-metadata">decimal-column-metadata</a> </li><li> <a href="tech.v3.libs.arrow.html#var-stream-.3Edataset">stream-&gt;dataset</a> </li><li> <a href="tech.v3.libs.arrow.html#var-stream-.3Edataset-iterable">stream-&gt;dataset-iterable</a> </li><li> <a href="tech.v3.libs.arrow.html#var-validity-.3Eindexes">validity-&gt;indexes</a> </li><li> <a href="tech.v3.libs.arrow.html#var-validity-.3Emissing">validity-&gt;missing</a> </li><li> <a href="tech.v3.libs.arrow.html#var-validity-info">validity-info</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.libs.clj-transit.html">tech.v3.libs.clj-transit</a></h3><div class="doc"><div class="markdown"><p>Transit bindings for the jvm version of tech.v3.dataset.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.libs.clj-transit.html#var-dataset-.3Etransit">dataset-&gt;transit</a> </li><li> <a href="tech.v3.libs.clj-transit.html#var-dataset-.3Etransit-str">dataset-&gt;transit-str</a> </li><li> <a href="tech.v3.libs.clj-transit.html#var-java-time-read-handlers">java-time-read-handlers</a> </li><li> <a href="tech.v3.libs.clj-transit.html#var-java-time-write-handlers">java-time-write-handlers</a> </li><li> <a href="tech.v3.libs.clj-transit.html#var-read-handlers">read-handlers</a> </li><li> <a href="tech.v3.libs.clj-transit.html#var-transit-.3Edataset">transit-&gt;dataset</a> </li><li> <a href="tech.v3.libs.clj-transit.html#var-transit-str-.3Edataset">transit-str-&gt;dataset</a> </li><li> <a href="tech.v3.libs.clj-transit.html#var-write-handlers">write-handlers</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.libs.fastexcel.html">tech.v3.libs.fastexcel</a></h3><div class="doc"><div class="markdown"><p>Parse a dataset in xlsx format. This namespace auto-registers a handler for
the 'xlsx' file type so that when using -&gt;dataset, <code>xlsx</code> will automatically map to
<code>(first (workbook-&gt;datasets))</code>.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.libs.fastexcel.html#var-input-.3Eworkbook">input-&gt;workbook</a> </li><li> <a href="tech.v3.libs.fastexcel.html#var-workbook-.3Edatasets">workbook-&gt;datasets</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.libs.guava.cache.html">tech.v3.libs.guava.cache</a></h3><div class="doc"><div class="markdown"><p>Use a google guava cache to memoize function results. Function must not return
nil values. Exceptions propagate to caller.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.libs.guava.cache.html#var-memo-stats">memo-stats</a> </li><li> <a href="tech.v3.libs.guava.cache.html#var-memoize">memoize</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.libs.parquet.html">tech.v3.libs.parquet</a></h3><div class="doc"><div class="markdown"><p>Support for reading Parquet files. You must require this namespace to
enable parquet read/write support.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.libs.parquet.html#var--.3Erow-group-supplier">-&gt;row-group-supplier</a> </li><li> <a href="tech.v3.libs.parquet.html#var-ds-.3Eparquet">ds-&gt;parquet</a> </li><li> <a href="tech.v3.libs.parquet.html#var-ds-seq-.3Eparquet">ds-seq-&gt;parquet</a> </li><li> <a href="tech.v3.libs.parquet.html#var-parquet-.3Eds">parquet-&gt;ds</a> </li><li> <a href="tech.v3.libs.parquet.html#var-parquet-.3Eds-seq">parquet-&gt;ds-seq</a> </li><li> <a href="tech.v3.libs.parquet.html#var-parquet-.3Emetadata-seq">parquet-&gt;metadata-seq</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.libs.poi.html">tech.v3.libs.poi</a></h3><div class="doc"><div class="markdown"><p>Parse a dataset in xls or xlsx format. This namespace auto-registers a handler for
the <code>xls</code> file type so that when using -&gt;dataset, <code>xls</code> will automatically map to
<code>(first (workbook-&gt;datasets))</code>.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.libs.poi.html#var-input-.3Eworkbook">input-&gt;workbook</a> </li><li> <a href="tech.v3.libs.poi.html#var-workbook-.3Edatasets">workbook-&gt;datasets</a> </li></ul></div></div><div class="namespace"><h3><a href="tech.v3.libs.tribuo.html">tech.v3.libs.tribuo</a></h3><div class="doc"><div class="markdown"><p>Bindings to make working with tribuo more straight forward when using datasets.</p>
</div></div><div class="index"><p>Public variables and functions:</p><ul><li> <a href="tech.v3.libs.tribuo.html#var-classification-predictions-.3Edataset">classification-predictions-&gt;dataset</a> </li><li> <a href="tech.v3.libs.tribuo.html#var-evaluate-regression">evaluate-regression</a> </li><li> <a href="tech.v3.libs.tribuo.html#var-make-classification-datasource">make-classification-datasource</a> </li><li> <a href="tech.v3.libs.tribuo.html#var-make-regression-datasource">make-regression-datasource</a> </li><li> <a href="tech.v3.libs.tribuo.html#var-predict-classification">predict-classification</a> </li><li> <a href="tech.v3.libs.tribuo.html#var-predict-regression">predict-regression</a> </li><li> <a href="tech.v3.libs.tribuo.html#var-train-classification">train-classification</a> </li><li> <a href="tech.v3.libs.tribuo.html#var-train-regression">train-regression</a> </li><li> <a href="tech.v3.libs.tribuo.html#var-trainer">trainer</a> </li></ul></div></div></div></body></html>