If you already tried coding it yourself, it would help if you told us what you tried so we know exactly where you have problems.
The basics of what I would do is to code the movement/collision system as though it were a regular up/down/left/right top-down thing, then code an isometric view for it. So if a character moved up in the movement system (increasing the y value) then the character in the view would move something like left-up (increasing the y value, decreasing the x value). It's a lot simpler to deal with collisions and visualize things and map things this way, since you don't have to constantly deal with diagonals. If you know how to code a 2D Zelda movement/collision system after setting that up, then you're pretty much set. If not, tell us!
Really nice music, by the way. I think it's a given that whatever game you make, it'll at least sound great.
EDIT: Seriously, I didn't know that chiptune hip hop could sound this good.