amazon-web-servicesamazon-s3

How do you full text search an Amazon S3 bucket?


I have a bucket on S3 in which I have large amount of text files.

I want to search for some text within a text file. It contains raw data only. And each text file has a different name.

For example, I have a bucket name:

abc/myfolder/abac.txt

xyx/myfolder1/axc.txt

& I want to search text like "I am human" in the above text files.

How to achieve this? Is it even possible?


Solution

  • The only way to do this will be via CloudSearch, which can use S3 as a source. It works using rapid retrieval to build an index. This should work very well but thoroughly check out the pricing model to make sure that this won't be too costly for you.

    The alternative is as Jack said - you'd otherwise need to transfer the files out of S3 to an EC2 and build a search application there.