Release 0.3.1
October 20, 2017 ยท View on GitHub
Major Features and Improvements
- We now provide helper methods for creating
serving_input_receiver_fnfor use with tf.estimator. These mirror the existing functions targeting the legacy tf.contrib.learn.estimators-- i.e. for each*_serving_input_fn()in input_fn_maker there is now also a*_serving_input_receiver_fn().
Bug Fixes and Other Changes
- Introduced
tft.apply_vocabthis allows users to separately apply a single vocabulary (as generated bytft.uniques) to several different columns. - Provide a source distribution tar
tensorflow-transform-X.Y.Z.tar.gz.
Breaking changes
- The default prefix for
tft.string_to_intvocab_filenamechanged fromvocab_string_to_inttovocab_string_to_int_uniques. To make your pipelines resilient to implementation details please setvocab_filenameif you are using the generated vocab_filename on a downstream component.
Release 0.3.0
Major Features and Improvements
- Added hash_strings mapper.
- Write vocabularies as asset files instead of constants in the SavedModel.
Bug Fixes and Other Changes
- 'tft.tfidf' now adds 1 to idf values so that terms in every document in the corpus have a non-zero tfidf value.
- Performance and memory usage improvement when running with Beam runners that use multi-threaded workers.
- Performance optimizations in ExampleProtoCoder.
- Depends on
apache-beam[gcp]>=2.1.1,<3. - Depends on
protobuf>=3.3.0<4. - Depends on
six>=1.9,<1.11.
Breaking changes
- Requires pre-installed TensorFlow >= 1.3.
- Removed
tft.mapusetft.apply_functioninstead (as needed). - Removed
tft.tfidf_weightsusetft.tfidfinstead. beam_metadata_io.WriteMetadatanow requires a secondpipelineargument (see examples).- A Beam bug will now affect users who call AnalyzeAndTransformDataset in
certain circumstances. Roughly speaking, if you call
beam.Pipeline()at some point (as all our examples do) you will not experience this bug. The bug is characterized by an error similar toKeyError: (u'AnalyzeAndTransformDataset/AnalyzeDataset/ComputeTensorValues/Extract[Maximum:0]', None)This bug will be fixed in Beam 2.2.
Release 0.1.10
Major Features and Improvements
- Add json-example serving input functions to TF.Transform.
- Add variance analyzer to tf.transform.
Bug Fixes and Other Changes
- Remove duplication in output of
tft.tfidf. - Ensure ngrams output dense_shape is greater than or equal to 0.
- Alters the behavior and interface of tensorflow_transform.mappers.ngrams.
- Depends on
apache-beam[gcp]=>2,<3. - Making TF Parallelism runner-dependent.
- Fixes issue with csv serving input function.
- Various performance and stability improvements.
Deprecations
tft.mapwill be removed on version 0.2.0, see theexamplesdirectory for instructions on how to usetft.apply_functioninstead (as needed).tft.tfidf_weightswill be removed on version 0.2.0, usetft.tfidfinstead.
Release 0.1.9
Major Features and Improvements
- Refactor internals to remove Column and Statistic classes
Bug Fixes and Other Changes
- Remove collections from graph to avoid warnings
- Return float32 from
tfidf_weights - Update tensorflow_transform to use
tf.saved_modelAPIs. - Add default values on example proto coder.
- Various performance and stability improvements.