Release NotesΒΆ

v0.0.3 October 9, 2020
  • Enhancements
    • Implement setitem on DataTable to create/overwrite an existing DataColumn (#165)

    • Add to_pandas method to DataColumn to access the underlying series (#169)

    • Add list_logical_types function and CLI to get dataframe of woodwork LogicalTypes (#172)

    • Add describe method to DataTable to generate statistics for the underlying data (#181)

    • Add optional return_dataframe parameter to load_retail to return either DataFrame or DataTable (#189)

    • Add get_mutual_information method to DataTable to generate mutual information between columns (#203)

    • Add read_csv function to create DataTable directly from CSV file (#222)

  • Fixes
    • Fix bug causing incorrect values for quartiles in DataTable.describe method (#187)

    • Fix bug in DataTable.describe that could cause an error if certain semantic tags were applied improperly (#190)

    • Fix bug with instantiated LogicalTypes breaking when used with issubclass (#231)

  • Changes
    • Remove unnecessary add_standard_tags attribute from DataTable (#171)

    • Remove standard tags from index column and do not return stats for index column from DataTable.describe (#196)

    • Update DataColumn.set_semantic_tags and DataColumn.add_semantic_tags to return new objects (#205)

    • Update various DataTable methods to return new objects rather than modifying in place (#210)

    • Move datetime_format to Datetime LogicalType (#216)

    • Do not calculate mutual info with index column in DataTable.get_mutual_information (#221)

    • Move setting of underlying physical types from DataTable to DataColumn (#233)

  • Documentation Changes
    • Remove unused code from sphinx conf.py, update with Github URL(#160, #163)

    • Update README and docs with new Woodwork logo, with better code snippets (#161, #159)

    • Add DataTable and DataColumn to API Reference (#162)

    • Add docstrings to LogicalType classes (#168)

    • Add Woodwork image to index, clear outputs of Jupyter notebook in docs (#173)

    • Update contributing.md, release.md with all instructions (#176)

    • Add section for setting index and time index to start notebook (#179)

    • Rename changelog to Release Notes (#193)

    • Add section for standard tags to start notebook (#188)

    • Add Understanding Types and Tags user guide (#201)

    • Add missing docstring to list_logical_types (#202)

    • Add Woodwork Global Configuration Options guide (#215)

  • Testing Changes
    • Add tests that confirm dtypes are as expected after DataTable init (#152)

    • Remove unused none_df test fixture (#224)

    • Add test for LogicalType.__str__ method (#225)

Thanks to the following people for contributing to this release: @gsheni, @tamargrey, @thehomebrewnerd

v0.0.2 September 28, 2020
  • Fixes
    • Fix formatting issue when printing global config variables (#138)

  • Changes
    • Change add_standard_tags to use_standard_Tags to better describe behavior (#149)

    • Change access of underlying dataframe to be through to_pandas with ._dataframe field on class (#146)

    • Remove replace_none parameter to DataTables (#146)

  • Documentation Changes
    • Add working code example to README and create Using Woodwork page (#103)

Thanks to the following people for contributing to this release: @gsheni, @tamargrey, @thehomebrewnerd

v0.1.0 September 24, 2020
  • Add natural_language_threshold global config option used for Categorical/NaturalLanguage type inference (#135)

  • Add global config options and add datetime_format option for type inference (#134)

  • Fix bug with Integer and WholeNumber inference in column with pd.NA values (#133)

  • Add DataTable.ltypes property to return series of logical types (#131)

  • Add ability to create new datatable from specified columns with dt[[columns]] (#127)

  • Handle setting and tagging of index and time index columns (#125)

  • Add combined tag and ltype selection (#124)

  • Add changelog, and update changelog check to CI (#123)

  • Implement reset_semantic_tags (#118)

  • Implement DataTable getitem (#119)

  • Add remove_semantic_tags method (#117)

  • Add semantic tag selection (#106)

  • Add github action, rename to woodwork (#113)

  • Add license to setup.py (#112)

  • Reset semantic tags on logical type change (#107)

  • Add standard numeric and category tags (#100)

  • Change semantic_types to semantic_tags, a set of strings (#100)

  • Update dataframe dtypes based on logical types (#94)

  • Add select_logical_types to DataTable (#96)

  • Add pygments to dev-requirements.txt (#97)

  • Add replacing None with np.nan in DataTable init (#87)

  • Refactor DataColumn to make semantic_types and logical_type private (#86)

  • Add pandas_dtype to each Logical Type, and remove dtype attribute on DataColumn (#85)

  • Add set_semantic_types methods on both DataTable and DataColumn (#75)

  • Support passing camel case or snake case strings for setting logical types (#74)

  • Improve flexibility when setting semantic types (#72)

  • Add Whole Number Inference of Logical Types (#66)

  • Add dtypes property to DataTables and repr for DataColumn (#61)

  • Allow specification of semantic types during DataTable creation (#69)

  • Implements set_logical_types on DataTable (#65)

  • Add init files to tests to fix code coverage (#60)

  • Add AutoAssign bot (#59)

  • Add logical types validation in DataTables (#49)

  • Fix working_directory in CI (#57)

  • Add infer_logical_types for DataColumn (#45)

  • Fix ReadME library name, and code coverage badge (#56, #56)

  • Add code coverage (#51)

  • Improve and refactor the validation checks during initialization of a DataTable (#40)

  • Add dataframe attribute to DataTable (#39)

  • Update ReadME with minor usage details (#37)

  • Add License (#34)

  • Rename from datatables to data_tables (#4)

  • Add Logical Types, DataTable, DataColumn (#3)

  • Add Makefile, setup.py, requirements.txt (#2)

  • Initial Release (#1)

Thanks to the following people for contributing to this release: @gsheni, @tamargrey, @thehomebrewnerd