A little bit of everything, no specific classification
Some esoteric, complex, binary data structure used in advanced ML research projects?
Dataframes, or tables, are everywhere!
Because of their ubiquity, a powerful library to load, manipulate, calculate, transform and export data frames is the core component of any data ecosystem
Long history – not a Kotlin invention
At least 30+ years old
Existing implementations are far from perfect
What can a Kotlin version improve upon?
%>%
notationdf.age
df.filter { age > 18 }
df[age]
df.filter { age() > 18 }
df["age"]
df.filter { "age"<Int>() > 18 }
{ name.upper() }
rather than pandas:
name.str.upper()
.toListOf<MyClass>()
A lot. Almost any kind of data comprehension or manipulation
We already have many tools capable of performing these tasks
So why add something new to this mix?
No database access? No SQL? Get outta here with this trash...
.toDataFrame()
Goal: Build customized marketing campaigns for casino patrons
Project is very much still in beta (alpha?) mode
merge_asof
,
SQL join with >
&
<
,
Excel VLOOKUP
)Speed and data capacity trail other projects significantly
As dataframe approaches beta/release