A customer requires a server infrastructure update for its Hadoop HBase solution. The HBase maximum
dataset size will be 120TB.
To host HBase, the architect is planning three HPE Moonshot 1500 chassis fully populated with cartridges.
The populated chassis provide the following in total:
- 135 processors with four 2.9GHz cores each
- 4.23TB RAM
- 64.8TB local storage on SSDs
An aspect of the design is under provisioned and is likely to cause performance to degrade to one-tenth or even one-hundredth of its potential.
How can the architect resolve this problem?
An architect needs to design a solution for a customer that stores all data on third-party NAS storage, and that does not yet have a database solution. The new HPE-based solution must be optimized for data mining.
How can the current data stored on the NAS be used for data mining?