Navigation

Deploy a Data Lake for an Atlas Cluster Data Store

On this page

  • Prerequisites
  • Procedure

This page describes how to deploy a Data Lake for accessing data in an Atlas cluster.

Before you begin, you will need to:

  • Create a MongoDB Atlas account, if you do not have one already.
  • Create an Atlas Cluster, if you do not have one already. Atlas Data Lake supports Atlas clusters deployed to AWS, Azure, or GCP.

    Note

    To use your Atlas cluster as a data store, you must deploy it to the same project as your Data Lake.

  • Add data to at least one collection on your Atlas cluster if you have not already.
1
2
3
  • For your first Data Lake, click Create a Data Lake.
  • For your subsequent Data Lakes, click Configure a New Data Lake.
4
  • For a guided experience, click Visual Editor.
  • To edit the raw JSON, click JSON Editor.
5
6
  1. (Optional) Click the for the:

    • Data Lake to specify a name for your Data Lake. Defaults to Data Lake[n].
    • Database to edit the database name. Defaults to Database[n].

      Corresponds to databases.[n].name JSON configuration setting.

    • Collection to edit the collection name. Defaults to Collection[n].

      Corresponds to databases.[n].collections.name JSON configuration setting.

    • View to edit the view name.

    You can click:

    • Create Database to add databases and collections.
    • associated with the database to add collections to the database.
    • associated with the collection to add views on the collection. To create a view, you must specify:

      • The name of the view.
      • The pipeline to apply to the view.

        Note

        The view definition pipeline cannot include the $out or the $merge stage. If the view definition includes nested pipeline stages such as $lookup or $facet, this restriction applies to those nested pipelines as well.

      To learn more about views, see:

    • associated with the database, collection, or view to remove it.
  2. Drag and drop the data store to map with the collection.

    Corresponds to databases.[n].collections.[n].dataSources JSON configuration setting.

7
Give Feedback

On this page

  • Prerequisites
  • Procedure