Right before we left last night Rohan was telling Kivan and me about how he made this script that would calculate the percentage of the total run time that each leg of the trip was taking up. This would hopefully be a useful tool for us in analyzing ugly rides and seeing which legs were using up a lot of resources as far as time goes. The script is pretty simple, but the only problem was that when double checking the percentages he realized that all the percentages from the different legs in a run didn't always add up to one. So, I told him I would look at it today and see if I could figure out where there might be a bug in the code. I don't want to spoil the ending for you, but I worked on debugging this script all day and (SPOILERS) I still could not figure out why it won't always give the correct percentages. Long story short, sometimes the percentages are incredibly accurate, but other times they are about (but not exactly) half what they should be, which makes it impossible for us to reliably use the data it outputs. It also apparently was dependent for some reason on the length or size of the data set it was given to work on because when given a subset of the same data the percentages would drastically change for reasons unknown. All in all, I don't think this script will be able to help us much, at least not as much as Rohan was hoping, but it was fun for me to play around with it and to try to do some debugging in R. Also in the process I ended up checking again to make sure our leg times script was outputting the correct leg times, which is good and important since those are used to calculate the ugly rides. Speaking of ugly rides, I asked Rohan then while I was working on his script if he would take a look at my ugly rides script and optimize it for efficiency because right now while it works it was a first pass attempt that ended up with four for loops. In the end, both Rohan and Frank worked on it and Frank got a new copy of the script to run faster and produce some output, so we finally had some output telling us which rides are considered ugly rides. It looks like we'll be able to use this to do more analysis delving deeper into specific cases next week.
Our day ended with a presentation by Candace Faber talking about collaboration with different stakeholders and big data. I really liked her presentation because it was interactive and used meaningful and captivating examples to illustrate the points she was trying to get across.
Our day ended with a presentation by Candace Faber talking about collaboration with different stakeholders and big data. I really liked her presentation because it was interactive and used meaningful and captivating examples to illustrate the points she was trying to get across.