Navigation

    Release Notes

    • Supports Parquet, CSV, and TSV formats for $out to S3 .
    • Adds a rolling limit for cursors.
    • Improves error messages for commands that cannot be parsed.
    • Supports defaultFormat for files in publicly accessible URL s in HTTP stores.
    • Limits the number of simultaneous queries to 30 per Data Lake.
    • Supports bzip2 compression format.
    • Supports comment option for the aggregate command.
    • Includes various performance and stability improvements.
    • Supports killOp command for terminating a long-running query.
    • Adds configuration for maximum number of wildcard collections for S3 data stores.
    • Improves $out to S3 write performance.
    • Includes general performance and stability improvements.
    • Adds correlationID to the $currentOp output.
    • Includes general performance and stability improvements.
    • Relaxes $out S3 region requirement.
    • Includes improved storage configuration error messages.
    • Includes general performance and stability improvements.
    • Supports $collStats aggregation pipeline stage.
    • Includes performance optimizations for ORC files.
    • Includes general performance and stability improvements.
    • Adds support for the skip and limit fields to the count() command.
    • Adds storageValidateConfig command to validate your Data Lake storage configuration.
    • Includes bug fixes and performance improvements.
    • Includes general performance and stability improvements.
    • Automates storage configuration generation for newly created Atlas Data Lakes.
    • Allows write partitioning-aware data to S3 using the $out in Atlas Data Lake.
    • Fixes stability issues.
    • Improves performance.
    • Supports the $currentOp stage so that you can monitor query progress on long-running queries.
    • Updates the isodate attribute to accept additional formats.
    • Refreshes the metadata catalog when you use Storage Configuration commands.
    • Includes various performance and stability improvements.
    • Supports filename field references for $out.
    • Supports $toString in $out to S3 .
    • Supports optionally granting Atlas Data Lake write access to S3 buckets, enabling use of $out semantics to write directly to those buckets.
    • Adds incremental store, database, collection, and view commands for storage configuration management.
    • Limits collections returned for wildcard collections to 1,000.
    • Updates the storage configuration format.
    • Supports cross-database $lookup queries.
    • Supports lowercase and uppercase file extensions.
    • Template segments now support dot-separated attribute names that correspond to nested fields.
    • Allows the defaultFormat to be specified without a leading dot.
    • Supports filtering based on stripes for files in ORC format.
    • Allows query attributes to be extracted after the first stage.
    • Includes several performance and stability improvements.
    • Supports partition definition for the following:

      • epoch_secs, which is seconds since the Unix Epoch
      • epoch_millis, which is milliseconds since the Unix Epoch
      • UUID, which is binary subtype 4
    • Includes several performance and stability improvements.
    • Adds support for reading Apache ORC files.
    • Supports filtering partitions by Parquet file row group statistics.
    • Supports ObjectIds in the path when specifying partition databases.<database>.<collection>.[n].definition.
    • Returns an error if a query produces a document larger than 16 MiB.
    • The $indexStats stage now produces an empty list of indexes instead of an error.
    • Supports $out to S3 storage format in JSON.
    • $match now implicitly treats all terms as conjunctions.
    • No longer parses empty files.
    • Fixes an issue that caused the {$match: {$expr: {$and: []}}} expression to terminate the connection.
    • Allows nested fields in partition definitions.
    • No longer enumerates directories on S3 when a single subdirectory containing all the partitions matching the query is identified.
    • Fixes an issue where the new storage configuration did not appear on the issuing connection after running setStorageConfig.
    • Adds support for the getLastError database command.
    • Fixes a bug with how union types are handled in Avro.
    • Supports $out aggregation pipeline stage to S3.
    • listIndexes now always returns an empty list.
    • Translates dot-delimited CSV and TSV keys into subdocuments.
    • Storage configuration error message now includes a link to the documentation.
    • Supports the XLSX file format.
    • Includes the correlation ID in query execution error messages.
    • Returns an error to the client when the cursor storage limit is reached.
    • Returns an error to the client on the last getMore if the cursor storage limit is exceeded.
    • Returns the first batch of cursor results more quickly.
    • Improves performance of $lookup when combined with $unwind.
    • Automatically supports SCRAM-SHA-1 credentials without requiring drivers to specify this authentication mechanism.
    • Provides a descriptive error message when the file format is unknown.
    • Provides additional validation on setStorageConfig.

    Initial public beta release of Atlas Data Lake.

    Give Feedback