Reindexing in Amazon OpenSearch Serverless with Amazon OpenSearch Ingestion | AWS

Reindexing in Amazon OpenSearch Serverless with Amazon OpenSearch Ingestion | AWS



Amazon OpenSearch Serverless offers a serverless option for managing search and analytics workloads without dealing with infrastructure. Users often need to copy documents between indexes for reasons like reindexing due to data changes or disaster recovery for redundancy. A recent feature in Amazon OpenSearch Ingestion now supports OpenSearch as a source, allowing seamless copying of data between indexes in OpenSearch Serverless collections. This eliminates the need for custom code and streamlines data management tasks.

To copy data between indexes in the same OpenSearch Serverless collection, users must follow a series of steps. This involves creating an IAM role for the OpenSearch Ingestion pipeline, updating the data access policy for the OpenSearch Serverless collection, and creating the OpenSearch Ingestion pipeline to copy data between indexes. Additionally, creating an index template using the pipeline can define explicit mapping rules for stored documents and fields.

By following these steps, users can effortlessly copy data between indexes within OpenSearch Serverless collections. The OpenSearch Ingestion pipeline also enables data transformation using various processors. AWS provides built-in pipeline integrations for ingesting data from various sources like Amazon DynamoDB, Amazon MSK, and more. Users can leverage these resources to build data pipelines with minimal configuration changes.

The authors, Utkarsh Agarwal and Prashant Agrawal, are experts in Amazon OpenSearch Service. They provide technical assistance to customers in building scalable solutions in the AWS Cloud. Utkarsh enjoys watching movies and cooking in his free time, while Prashant helps customers fine-tune their clusters for better performance. When not working, Prashant loves traveling and exploring new places.

Article Source
https://aws.amazon.com/blogs/big-data/perform-reindexing-in-amazon-opensearch-serverless-using-amazon-opensearch-ingestion/