Now that I have completed the Spark/Hadoop stack, I am now looking for data to start piling onto the system. I have Python, R, Scala and some other things ready to use, and I think using the Zeppelin and Jupyter worksheets would be great to manage the procedures.
I have been looking at the Chandra data, and I am trying to get the schema right so I can start using a small section of sky to explore and work some models from. I would love to get the images from the different spectrum gratings and do a stacked image like traditional Photometry. If I can get a solid platform of processing going, I may want to do some interesting calculations to sort out ways to classify and identify interesting things.
I also watched a great 40yr anniversary tour of the VLA in Arizona. There was a particular image I saw in the science presentation, which showed the venn diagram of two things. I remember the vertical axis was known/unknown. However, I am struggling with the horizontal axis, which is driving me crazy. The closest I can get is Data/No Data. On these two axes, you get quarters of research topics and exercises. In data and known you get demographic studies. In data and unknown, you get outliers. I don't think this is right, but that is pretty close. I am going to continue to search for that.
I also tried in vain to get the laptop to control the telescope mount. This was very annoying. I really want to get it working for tracking. However, there are a few more things I need to try before I give up.
I also installed a personal cloud type service, with features similar to Google. You can keep files on one drive, but aslso create new Office documents there as well. I had trouble wioth that part because it requires a separate server to be setup, and a crazy reverse-proxy setup to get it all working on a VirtualBox. Really? no.