Tag Archives: google

Google Cloud Platform technical qualification training: App Engine

I’ve just spend the week at the Google office in London to attend the CP300 course. I thought a good way to prepare for the certification exams would be to put down my notes on my blog…

Thanks to Ignacio who was leading the training.

The first two days were dedicated to Google App Engine.
App Engine is all about building scalable, reliable and cost effective web application the Google way, it :

Leverages Google CDN to serve static ressources
Use Stateless application server with automatic horizontal scaling
Use a NoSql datastore (you can also connect it to a relationnal database, cloud SQL, if needed)

You can configure the way it scales by tweaking pending latency and the number of idle instances. This will impact the performance and the cost of your application.

Instance on Appengine can stop and start frequently, this means you should avoid framework with long start-up time such as Spring or JPA. For depency injection prefer Guice or Dagger (injection is done at compile time)

There is a status console to check if all Google services runs normally. You can also receive notification about downtime by subscribing to this group.

The App Engine console let you monitor quotas usages, very important, most errors on App Engine append because of quota limitations. Of course you can pay to remove those limits. You set a maximum daily budget to make sure you won’t suffer from a denied of service attack on your credit card !

You can deploy and run in parallel multiple versions of the same app (blue/green deployment out of the box)

The app stat tool let you analyze performances.

Authentication & Authorization

GAE provides a service to handle Authentication & Authorization for you. It will use Google account or an openId provider. You can also integrate GAE with an enterprise SSO solution but it requires a Google Apps for business account.

Authorization to access other google API (calendar, storage, compute, …) is done with OAuth2.0.
You can try service calls and Oauth2.0 in the playground

The Datastore

This is the heart of App Engine, you better understand this if you wan’t your application to run well on App Engine.

The GAE datastore is based on Google BigTable, it provides strong consistency for single row but eventual consistency for multi row level.
Every row contains an entity of a certain kind. An entity has a key and properties, properties can be multi-valued.

An entity can have a parent to form an entity group (a single entity without parent count as an entity group). Entity group are usefull to force strong consistency when writing data.

Data on bigtable is distributed by key, if you specify the key yourself make sure it is random enough to get a good distribution of content on the underlining hardware and better performance.

The DataStore is optimized for read queries. Datastore always use an index to read data. All indexes are sorted and distributed on multiple machines.
Queries on the datastore are executed as index scan on bigtable => it’s very fast (the query performance scale with the size of the result not the size of the dataset) but it comes with a few limits:
– You can’t query without an index (indexed can be automaticaly created, beware of their size)
– Queries on multi-valued properties can lead to combinatorial Explosion and big indexes
– Missing properties is not equal to Null/none
– Inequality filter (!=) are limited to one property per query (this is because it is implemented as x< AND x> to use one sorted index)
– no JOIN (use denormalization)
– no aggregation queries (Group by, sum, having, avg, max, min, …) (instead use special entities that maintains counts) see sharding counter pattern
– creating a new index on large set can be long

Indexes are not immediatly updated when writing but ancestor queries force the index update to complete to get strong consistency.

For transaction the datastore use snapshot isolation and optimistic concurrency
Transcation can’t affect more than 5 entity groups
Can’t make more than 5 updates per second to an entity group
A transaction can’t take more than 60 seconds

Memcache, TaskQueue, Cron

GAE provides Memcache as a service to improve performance and reduce application cost. A memcache query can be ten times faster than a datastore query. Memcache can be used as a read/Write cache to the datastore

GAE provides a taskQueue service to executed asynchronous work :

push queues are managed by App Engine
pull queues are manually managed

Tasks can by enqueued in a transaction but will execute outside the transaction
A task is a GET or POST request
GAE execute as many task as possible following the token bucket algorithm
there is
– a bucket size (maximum number of tasks that can be launch at once)
– a token refresh rate (how fast the bucket replenish)
– a maximum number of concurrent requeste

If a task failed it will be re-tried according to the retry policy.
There is a 10 minutes execution limit on front-end instance (instead of 1 min for synchronous requests)

GAE also provides a cron service that you can configure in an xml or yaml file.

My schedule for Google I/O 2014

This year Google I/O will be about DDD, have you read the blue book ? Oh wait, sorry, it’s not about Domain Driven Design but Design, Develop, Distribute. Interesting to see that Google choose to replace the more common “Run” theme with a Distribute one. It feels like they are saying don’t worry anymore about how you will run your application, just use our cloud. But instead think how you will Distribute your mobile application… on google store of course.

We can expect a lot of announcements and sessions around Android. And a lot more, as you can see in the list of sessions I’m planning to attend. Can’t wait to learn more about Docker, Polymer, DevOps and the Google cloud platform !

Day one June 25

9AM (18H in Paris) 2 hours of keynote, can’t miss that !
Announcements, goodies,…
11AM Developer workflow around Docker containers
Great one of my favorite subject to get started
12PM Containers on Google App Engine
Yeah I stay focused on distributing Docker images
1PM Polymer and the Web Components revolution
Let’s switch from container to components
2PM Polymer and Web Components change everything you know about Web development
And stay focused again !
2PM Zero to hero with Google Cloud Platform
Unless I go back the cloud platform
2PM Building a Lambda Architecture in 10 minutes with BigQuery, CEP and Docker
Or more docker
3PM Continuous integration with Google Cloud Release Pipelines
Let’s see how it compares to cloudbees
4PM Unlock the next era of UI development with Polymer
yes many sessions about polymer

Day 2 June 26

9AM Building a web app on App Engine
Using the Go langage !
10AM Devops power tools
10AM Authentication and third party API
Oauth2 early in the morning, is that a good idea ?
11AM Upgrading the engine mid-flight: How Google improves its web apps without downtime
Remember this
12PM Prototyping with Polymer
More Polymer…
2PM Taming your cloud applications with intelligent monitoring
3PM DevOps at the speed of Google
4PM Predicting the future with the Google Cloud Platform

Et en attendant…

Je me promène

Démo google Wave

Grâce à ces 5 astuces et outils pour Google Wave je peux enfin vous montrer à quoi ça ressemble

Hum cette Wave est publique mais il semblerait qu’il faille quand même un compte google wave pour la voir, désolé.

[wave id=”googlewave.com!w+rnoO2JokI”]

Google Web Elements

Toujours plus simple, telle pourrait être la devise de google.

Un copier/coller m’a suffit pour intégrer une recherche google limité à ce site, démo en bas à gauche.

D’autres Web elements sont disponibles.

Google data center

in the time it takes to do a Google search, your own personal computer will use more energy than we will use to answer your query

Google datacenters

Aurélien Pelletier

Web,Open source, Agile, Architecture, Innovation

Tag Archives: google

Démo google Wave

Google Web Elements

Google data center