Base of difference | Hadoop | Teradata |
Cost | Hadoop is less expensive then Teradata as commodity hardware less expensive and no licence needed. | Teradata is expensive then Hadoop as this need paid software licensing and hardware. |
Type of data | Hadoop can store multiple types of data like structured, semi-structured and unstructured data as well. | Teradata can also store multiple types of data like structured, semi-structured and unstructured data as well. |
Processing | Hadoop processing and understanding is quite easier then Teradata. | Teradata processing and understanding is difficult then BigData. |
Code Development | Code development in Hadoop is bit difficult as this needs some extra efforts as this need Java/Python for writting mapper and reducers. | Coding in Teradata is very easy as this needs only SQL Query need to written. |
Parallel Processing | Workload is divided among the different nodes on which relevant data is present and each node processes the task individually in parallel which reduces the overall time taken to complete the task. | In Teradata is divided across the system and evenly among the processors in the system. |
High Scalable | In Hadoop we can add more number of nodes/disks when required to increase the processing and storage power. | We can also add more nodes/disks that will increase the licensing cost. |
Multiple Copies | Hadoop will contains multiple copy of data. | Teradata can also have multiple copy of Data. |
Hardware Failure | When one of the job/node fails then same job is triggered on different node on which replica of data is present. | In Teradata if any of job fails then we can trigger same job on a different processor with different replica of data. |
Share-nothing Architecture | In Hadoop task execution on any node of the Hadoop is independent to tasks executing on other nodes. | Teradata task executing in a virtual processor is independent of the tasks in other virtual processors. |
Data Distribution | In Hadoop data is distributed among different nodes as per the space available in the data nodes. | In Teradata the hashing operation is performed over the primary key of a table to distribute the data evenly over the disks. |