Autoquery Data Caching - How to control data refresh?

jrodrigu · May 9, 2019, 11:26pm

This is another question where I feel like I’m missing something obvious, but here goes:

I have an AutoQuery Data (i.e. using QueryDataContext.ServiceSource) endpoint that is storing the results in a Redis Cache. The call to get the data the first time is slow and expensive (can take several minutes), and I can sometimes get multiple requests for new data before the cache can get created (either initially or when the cache expires), enough to negatively impact the server as a whole. I thought about gating the request call via a Redis Lock:

// GetMyData is used as the ServiceSource for AutoQueryDataFeature
 public  List<MyData> Any(GetMyData request)
        {
            using (IRedisClient redis = clientsManager.GetClient())
            {               
                using (redis.AcquireLock("ExpensiveCall", TimeSpan.FromMinutes(5)))
                {
                           // Expesnive call here
                }
            }
        }

… which helps manage the overall load (at the cost of some client code timing out), but doesn’t solve the problem of just getting the data once and relying on the cache. Is there a way to update the cache in the background instead through another process, or to serve the blocked clients cache data once I get it?

mythz · May 10, 2019, 12:10am

If you’re using an AutoQuery Data Service there are overloads that accept a Cache Provider and Duration that you want to cache the results for.

Alternatively you can use a [CacheResponse] above your Custom AutoQuery implementation.

mythz · May 10, 2019, 12:32am

These caches don’t block so you could still have multiple requests before the first cache is primed.

You could use a lock()


static object oLock = new object();

[CacheResponse(Duration = 300)]
public  List<MyData> Any(GetMyData request)
{
     lock (oLock)
     {
         //... expensive operation
     }
}

I’d only use redis distributed locking if you have multiple load balanced app servers and you absolutely want the same request across all of them.

The AutoQuery cachable Memory data source is an alternative strategy of pre-loading a disconnected data source and have AutoQuery Serviecs operate of that, i.e:

.AddDataSource(ctx => ctx.MemorySource(() => 
 $$"https://api.github.com/repos/ServiceStack/{ctx.Request.GetParam("repo")}/contributors"
   .GetJsonFromUrl(req => req.UserAgent="AutoQuery").FromJson<List<GithubContributor>>(),
  HostContext.LocalCache, 
  TimeSpan.FromMinutes(5)
));

But that would depend on how large the entire dataset you want cached in.

jrodrigu · May 10, 2019, 1:01am

I’m using both CacheResponse (since GetMyData is also used as an endpoint outside of Autoquery), and the cached overload

//Autoquery feature code above
.AddDataSource(ctx => ctx.ServiceSource<MyData>(new GetMyData(),
                        HostContext.Cache, TimeSpan.FromHours(20)))

Using a non distributed lock makes sense since I don’t necessarily need the same request across my load balanced servers… is there any way to serve a cached response to a client blocked by the lock (once the lock is released)? I’m guessing not since caching short circuits the request, and if you are already in the request it’s too late?

mythz · May 10, 2019, 1:29am

You would need to lock around the cache inside your Service which you can do using the older ToOptimizeResult* APIs, e.g:

public  List<MyData> Any(GetMyData request)
{
     lock (oLock)
     {
         var cacheKey = "unique_key_for_this_request"; //e.g. Request.RawUrl
         return base.Request.ToOptimizedResultUsingCache(base.Cache,cacheKey, () => 
             //... expensive operation
         });
     }
}

jrodrigu · May 10, 2019, 1:46am

Ok, but would that work with the caching done by the AutoQuery, or do I need to turn off caching there? e.g.

// Don't cache results here, rely on the service to cache data
.AddDataSource(ctx => ctx.ServiceSource<MyData>(new GetMyData(),
                        null, TimeSpan.FromHours(20)))

Or, should I use the in Memory LocalCache in conjunction with ToOptimizedResult (with a shorter expiration to cover clients stuck in this state), and use my regular cache (Redis) with AutoQuery? Something like:

Global.asax:

//Autoquery feature code above
.AddDataSource(ctx => ctx.ServiceSource<MyData>(new GetMyData(),HostContext.Cache, TimeSpan.FromHours(20)))

Service.cs

public  List<MyData> Any(GetMyData request)
{
     lock (oLock)
     {
         var cacheKey = "unique_key_for_this_request"; //e.g. Request.RawUrl
         return base.Request.ToOptimizedResultUsingCache(LocalCache, cacheKey TimeSpan.FromMinutes(5), () => 
             //... expensive operation
         });
     }
}

mythz · May 10, 2019, 1:55am

This just shows generic caching for a single query (it’s only going to be able to cache the exact same query), if you wanted to cache all auto queries I would just use MemorySource and load the snapshot datasource you want all AutoQuery Data requests to query, I’d really avoid trying to do any double-caching.

jrodrigu · May 10, 2019, 2:17am

Great, this helps…thanks!