Expanding the Azure Data Box Family

In a previous blog I introduced Azure Data Box. TodayI’d like to talk about how Microsoft is expanding the Azure Data Boxfamily by introducing you to the Azure Data Box Gateway and the AzureData Box Edge devices.

Until now the Data Box Family has been the disc, the box and theheavy. Each have their own limits for storage but are designed toimprove your way of uploading massive amounts of data into Azure,without having to wait for it to travel across the wire or saturate yourbandwidth (consider that the offline method).

Microsoft learned from customers that they want a better way to synctheir local storage directly with Azure storage for operations likearchival and disaster recovery. Here’s where Azure Data Box Gatewaycomes in.

The Data Box Gateway is a cloud storage gateway device thatresides on premises and sends your image, media and other data directlyto Azure.

  • The Gateway is a virtual machine provisioned in your Hypervisor(VMware or Hyper V) where you write the data directly to this virtualdevice using the NFS or SMB protocols, which it then sends to Azure.
  • One use case for the Data Box Gateway is for things like continuously ingesting massive amounts of data.So, we have a local data source that requires large data amounts andcapacities and we can stream those and sync them directly with our Azurestorage.
  • Another use case would be for a cloud archival of data in a secure and efficient way.If you then think about the incremental data transfer over the networkafter the initial bulk transfer is done using the Data Box of yourchoice for direct tie in to the same Azure storage container that you’reusing for your Data Box.

Azure Data Box Edge is a storage solution that allows you to process data and send it over the network to Azure.

  • Data Box Edge uses a physical device supplied by Microsoft to accelerate the secure data transfer.
  • The device resides on premises in your network stack and you write data to it (also using NFS or SMB.)
  • It is additionally equipped with AI enabled Edge computingcapabilities which help to analyze, process or filter data as it movesto Azure block blob, page blob or Azure files.
  • It has the appropriate chips to process intelligent learning(artificial intelligence, machine learning, deep learning and such).
  • Use cases are for things like pre-processing data. So, wecan analyze data from on premises or IOT devices to get fasterinformation about the data. That pre-processing will allow us to dothings like aggregating your data before it gets sent to Azure ormodifying data, such as taking out PII.
  • You can also subset and transfer the data needed for deeper analytics in the cloud.
  • Additionally, you can analyze and react to IOT events.So, if you’re running IOT devices on prem and you want the ability tobe quicker to respond when those events occur, this is a great way tohandle that.
  • Another great use case is you can run machine learningmodels to get quick results that can be acted on before the data is sentto the cloud.
  • With these IOT use cases, you don’t have to wait for thedata to be transmitted over the wire, do any of the munging happening upin Azure and then return results. You can return those results on thefly in real time and react more quickly.
  • Eventually the full data set is transferred to continue and help youto retain and improve any of your machine learning models. You cancontinually feed it data and have those models trained repeatedly, thuslearning to be more concise over time.

The Data Box family is a very cool technology by having an online version to further extend its capabilities.

Previous
Previous

How and Why to Add a Source Code Repository to Azure Data Factory

Next
Next

5 Ways Azure Makes Your Enterprise More Secure