Functions¶
Summary¶
function |
class parent |
truncated documentation |
---|---|---|
if this function is added to the module, the help automation and unit tests call it first before anything goes on … |
||
Checks the library is working. It raises an exception. If you want to disable the logs: |
||
Hashes a set of columns in a dataframe. Keeps the same type. Skips missing values. |
||
Shuffles a dataframe. |
||
One column may contain concatenated values. This function splits these values and multiplies the rows for each split … |
||
Returns a dummy streaming dataframe mostly for unit test purposes. |
||
Enumerates items from a JSON file or string. |
||
Flattens a dictionary with nested structure to a dictionary with no hierarchy. |
||
Hashes a float into a float. |
||
Hashes an integer into an integer. |
||
Hashes a string. |
||
Returns the list of numpy available types. |
||
Replaces the nan values for something not nan. Mostly used by |
||
Does a groupby including keeping missing values (nan). |
||
Reads a dataframe from a zip file. It can be saved by |
||
Randomly splits a dataframe into smaller pieces. The function returns streams of file names. The function relies … |
||
Randomly splits a dataframe into smaller pieces. The function returns streams of file names. The function relies … |
||
Saves a Dataframe into a zip file. It can be read by |
||
This split is for a specific case where data is linked in one way. Let’s assume we have two ids as we have for online … |
||
This split is for a specific case where data is linked in many ways. Let’s assume we have three ids as we have for … |
||
Splits a database in train/test given, every row can have a different weight. |