Hi asmi - you've previously made dev boards for Xilinx FPGAs haven't you?
Yes I did, and not just devboards.
I'm starting to need something bigger than the 10M50 in the DECA, so I'm basically looking to create a dev board with a similar form-factor to the DECA board - specifically the Beagle-bone compatible headers - so I can just plug it directly into my DIY computer and start testing. From a quick browser just now, the Xilinx Artix XC7A75T is looking like a likely candidate - 1mm ball pitch on the FTG256 package means I'm far more likely to get all those I/O's routed out as well, even with only a 4-layer board. I want to be able to test the GPU HDL in a different architecture from the existing Intel/Altera hardware we're using.
The biggest problem is going to be sourcing the part. Right now as far as I can tell, LCSC is the only place I know where you can find some Artix parts in stock. Unless you want to try your luck with the likes of Aliexpress - you can find pretty much any Artix-7 part there, but the prices are very random - from super-attractive to outright ridiculous, and of course the quality is unknown. Though I personally bought several parts off Ali and those I actually used were working OK (mostly reballs, which is expected considering the price), but there is always a chance for a dud. I guess for a personal/one-off/hobby project that shouldn't be a problem.
Also with recent drop in prices there is no longer a reason to limit yourself to 4 layer boards, as 6 layer gives you two extra routing layers and make layout much easier. DDR3 layout is already hard, no reason to make it even harder just to save a few bucks - unless you don't value your time at all.
As for your project, I would actually recommend going for a FGG484 package - it will allow implementing a 32 bit DDR3@400 MHz interface to give you additional room to grow in terms of memory bandwidth, it also has 4 multi-gigabit transceivers in case you want to implement some kind of serial interface - like SATA, PCI Express (say for NVMe storage, or some extension slot for future upgrades), DisplayPort, high-bandwidth HDMI (above 1080p@60), or something else. Speed grade 2 devices and higher can go as high as 6.6 Gbps per transceiver.
Finally, these Artix parts get pretty toasty, so you will need to provision for some sort of heatsink, or maybe even a small fan to help keep things from overheating. Remember, unlike CPUs, FPGAs don't have built-in overheating protection (unless you implement one yourself, that is), and will happily fry themselves if you are not careful.
But it's kind of offtopic in this thread, so if you want to discuss this further, let's find a more suitable venue.