TechEcho

5 comments

pweissbrodalmost 8 years ago

Hadoop (and hence HDFS) is a stack of services designed to work together to serve a file system and manage jobs. The hadoop stack has a pluggable authentication/authorization by design. And yes, the default is "no security".Given the distributed nature, HDFS runs on multiple machines. In linux distributed service security fits well with kerberos. Normally if you want a "secure" HDFS you must "kerberize" the services such that any hadoop operation requires a valid/authorized TGT.To most people kerberizing a hadoop cluster is a major barrier to getting hadoop running. I dont see this changing but certain vendor hadoop distros break down some of the barriers.Sometimes it is OK if you run a cluster insecure. Please dont do it if youre handling my financial or medical records though. As Mr.T once said 'dont write checks that yo ass cant cash'

评论 #15021323 未加载

评论 #15024254 未加载

评论 #15021885 未加载

sbarrealmost 8 years ago

Why do all these products have "insecure by default" configurations, anyways?Didn't we learn anything from register_globals?

评论 #15020335 未加载

评论 #15020776 未加载

评论 #15020258 未加载

评论 #15020344 未加载

评论 #15021225 未加载

评论 #15020739 未加载

评论 #15023451 未加载

评论 #15025147 未加载

评论 #15020334 未加载

iamjochemalmost 8 years ago

even if node-to-node communication in a cluster (hadoop or otherwise) itself is not secured, is it not reasonable to secure external access to the cluster itself (i.e. with a firewall)?from an outsider perspective (I've never used/run hadoop) I cannot see much reason for exposing the cluster to the outside world - either a web-app acts as an intermediary or access can be provided via VPN/ssh-tunnel/etc... just curious why a fully/publically exposed cluster would be a "requirement"? or does it come down to the fact that firewalling an AWS environment is as painful (if not more) than "kerberizing" a [hadoop] cluster? (I kind of assumed AWS has firewalling functionality that is fairly plug'n'play ... a quick search does really back that up though)

评论 #15022230 未加载

评论 #15021520 未加载

jarymalmost 8 years ago

I knew it was a bad idea to post 'getting started' tutorials that skipped all the security steps and replace them with a 'probably don't wanna do it this way in production' (and usually no documentation on how one should do it)...Not levelling this comment at HDFS solely but it's about time people stopped with the 'hello world' style examples.

评论 #15026131 未加载

Danihanalmost 8 years ago

This was back in May, I wonder how it has changed / if anyone parsed some of this data..

评论 #15021208 未加载

评论 #15020974 未加载

5 comments

pweissbrodalmost 8 years ago

评论 #15021323 未加载

评论 #15024254 未加载

评论 #15021885 未加载

sbarrealmost 8 years ago

Why do all these products have "insecure by default" configurations, anyways?Didn't we learn anything from register_globals?

评论 #15020335 未加载

评论 #15020776 未加载

评论 #15020258 未加载

评论 #15020344 未加载

评论 #15021225 未加载

评论 #15020739 未加载

评论 #15023451 未加载

评论 #15025147 未加载

评论 #15020334 未加载

iamjochemalmost 8 years ago

评论 #15022230 未加载

评论 #15021520 未加载

jarymalmost 8 years ago

评论 #15026131 未加载

Danihanalmost 8 years ago

This was back in May, I wonder how it has changed / if anyone parsed some of this data..

评论 #15021208 未加载

评论 #15020974 未加载

The HDFS Juggernaut

5 comments

The HDFS Juggernaut

5 comments